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NOVEL METHODS OF DIAGNOSIS OF METASTATIC COLORECTAL 
CANCER, COMPOSITIONS AND METHODS OF SCREENING FOR 
MODULATORS OF METASTATIC COLORECTAL CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
The present application is related to USSN 60/272,206, filed February 27, 
2001, USSN 60/281,149, filed April 2, 2001, and USSN 60/284,555, filed April 17, 2001, all 
of which are herein incorporated by referenced in their entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
metastatic colorectal cancer; and to the use of such expression profiles and compositions in 
diagnosis and therapy of metastatic colorectal cancer. The invention further relates to 
methods for identifying and using agents and/or targets that inhibit metastatic colorectal 
cancer. 

BACKGROUND OF THE INVENTION 
Cancer of the colon and/or rectum (referred to as "colorectal cancer") are 
significant in Western populations and particularly in the United States. Cancers of thexolon 
and rectum occur in both men and women most commonly after the age of 50. These 
develop as the result of a pathologic transformation of normal colon epithelium to an invasive 
cancer. There have been a number of recently characterized genetic alterations that have 
been implicated in colorectal cancer, including mutations in two classes of genes, tumor- 
suppressor genes and proto-oncogenes, with recent work suggesting that mutations in DNA 
repair genes may also be involved in tumorigenesis. For example, inactivating mutations of 
both alleles of the adenomatous polyposis coli (APC) gene, a tumor suppressor gene, appears 
to be one of the earliest events in colorectal cancer, and may even be the initiating event. 
Other genes implicated in colorectal cancer include the MCC gene, the p53 gene, the DCC 
(deleted in colorectal carcinoma) gene and other chromosome 18q genes, and genes in the 
TGF-p signaling pathway. For a review, see Molecular Biology of Colorectal Cancer, pp. 
238-299, in Curr. Probl Cancer, Sept/Oct 1997; see also Willams, Colorectal Cancer 



l 



WO 02/068677 



PCT/US02/06001 



(1996); Kinsella & Schofield, Colorectal Cancer: A Scientific Perspective (1993); Colorectal 
Cancer: Molecular Mechanisms, Premalignant State and its Prevention (Schmiegel & 
Scholmerich eds., 2000); Colorectal Cancer: New Aspects of Molecular Biology and Their 
Clinical Applications (Hanski et al. 9 eds 2000); McArdle et al; Colorectal Cancer (2000); 
Wanebo, Colorectal Cancer (1993); Levin, The American Cancer Society: Colorectal Cancer 
(1999); Treatment of Hepatic Metastases of Colorectal Cancer (Nordlinger & Jaeck eds., 
1993); Management of Colorectal Cancer (Dunitz et al. 9 eds. 1998); Cancer: Principles and 
Practice of Oncology (Devita et al 9 eds. 2001); Surgical Oncology: Contemporary Principles 
and Practice (Kiiby et al, eds. 2001); Offit, Clinical Cancer Genetics: Risk Counseling and 
Management (1997); Radioimmunotherapy of Cancer (Abrams & Fritzberg eds. 2000); 
Fleming, AJCC Cancer Staging Handbook (1998); Textbook of Radiation Oncology (Leibel 
& Phillips eds. 2000); and Clinical Oncology (Abeloff al, eds. 2000). 

Imaging of colorectal cancer for diagnosis has been problematic and limited. 
In addition, metastasis of the tumor to the lumen, and metastasis of tumor cells to regional 
lymph nodes are important prognostic factors {see, e.g., PET in Oncology: Basics and 
Clinical Application (Ruhlmann et al eds. 1999). For example, five year survival rates drop 
from 80 percent in patients with no lymph node metastases to 45 to 50 percent in those 
patients who do have lymph node metastases. A recent report showed that micrometastases 
can be detected from lymph nodes using reverse transcriptase-PCR methods based on the 
presence of raRNA for carcinoembryonic antigen, which has previously been shown to be 
present in the vast majority of colorectal cancers but not in normal tissues. Liefers et al, New 
England! of Med. 339(4):223 (1998). In addition, colorectal cancers often metastasize to 
the liver. However, the lack of information about the gene expression exhibited by these 
cancers limits the ability to effectively diagnose and treat the disease. 

Thus, methods for diagnosis and prognosis of metastatic colorectal cancer and 
effective treatment of colorectal cancer would be desirable. Accordingly, provided herein are 
methods that can be used in diagnosis and prognosis of metastatic colorectal cancer. Further 
provided are methods that can be used to screen candidate therapeutic agents for the ability to 
modulate, e.g., treat, colorectal cancer. Additionally, provided herein are molecular targets 
and compositions for therapeutic intervention in metastatic colorectal disease and other 
metastatic cancers. 

SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that are up- and 
down-regulated in metastatic colorectal cancer cells. Such genes and the proteins they 
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encode are useful for diagnostic and prognostic purposes, and also as targets for screening for 
therapeutic compounds that modulate metastatic colorectal cancer, such as antibodies. The 
methods of detecting nucleic acids of the invention or their encoded proteins can be used for 
a number of purposes. Examples include, early detection of colon cancers, monitoring and 
early detection of relapse following treatment of colon cancers, monitoring response to 
therapy of colon cancers, determining prognosis of colon cancers, directing therapy of colon 
cancers, selecting patients for postoperative chemotherapy or radiation therapy, selecting 
therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous colon adenomas. Other aspects of the invention will become 
apparent to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a 
metastatic colorectal cancer-associated transcript in a cell from a patient, the method 
comprising contacting a biological sample from the patient with a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-26. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-26. In another embodiment, the 
polynucleotide comprises a sequence as shown in Tables 1-26. 

In one embodiment, the biological sample is a tissue sample. In another 
embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat 
metastatic colorectal cancer. In another embodiment, the patient is suspected of having 
metastatic colorectal cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides methods of detecting 
polypeptide encoded by a metastatic colorectal cancer-associated transcript in a cell from a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody that specifically binds a polypeptide encoded by a sequence at least 80% identical to 
a sequence as shown in Tables 1 -26. . . 
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In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: (i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (ii) determining the level of a metastatic colorectal cancer-associated transcript 
in the biological sample by contacting the biological sample with a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-26., thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated transcript to a level of the metastatic 
colorectal cancer-associated transcript in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: (i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (ii) determining the level of a metastatic colorectal cancer-associated antibody 
in the biological sample by contacting the biological sample with a polypeptide encoded by a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-26, wherein the polypeptide specifically binds to the metastatic 
colorectal cancer-associated antibody, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated antibody to a level of the metastatic 
colorectal cancer-associated antibody in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: (i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (ii) determining the level of a metastatic colorectal cancer-associated 
polypeptide in the biological sample by contacting the biological sample with an antibody, 
wherein the antibody specifically binds to a polypeptide encoded by a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1 -26, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated polypeptide to a level of the metastatic 
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colorectal cancer-associated polypeptide in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-26. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

In one aspect, the present invention provides an isolated polypeptide which is 
encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-26. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-26. 

In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a 
metastatic colorectal cancer cell in a biological sample from a patient, the method comprising 
contacting the biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to metastatic colorectal cancer in a patient, the method comprising 
contacting a biological sample from the patient with a polypeptide encoded by a nucleic acid 
comprises a sequence from Tables 1-26. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a metastatic colorectal cancer-associated polypeptide, the method 
comprising the steps of: (i) contacting the compound with a metastatic colorectal cancer- 
associated polypeptide, the polypeptide encoded by a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26; and 
(ii) determining the functional effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an en2ymatic 
effect, or a chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. 

-In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 
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In another aspect, the present invention provides a method of inhibiting 
proliferation of a metastatic colorectal cancer-associated cell to treat colorectal cancer in a 
patient, the method comprising the step of administering to the subject a therapeutically 
effective amount of a compound identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay 
comprising the steps of: (i) administering a test compound to a mammal having colorectal 
cancer or a cell isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-26. in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell or mammal, wherein a test compound that modulates the level 
of expression of the polynucleotide is a candidate for the treatment of colorectal cancer. 

In one embodiment, the control is a mammal with colorectal cancer or a cell 
therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In another aspect, the present invention provides a method for treating a 
mammal having colorectal cancer comprising administering a compound identified by the 
assay described herein. 

hi another aspect, the present invention provides a pharmaceutical 
composition for treating a mammal having colorectal cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of colon and/or rectal cancer (e.g., colorectal 
cancer), including metastatic colorectal cancers, as well as methods for screening for 
compositions which modulate colorectal cancer. By "metastatic colorectal cancer" herein is 
meant a colon and/or rectal tumor or cancer that is classified as Dukes stage C or D (see, e.g., 
Cohen et al, Cancer of the Colon, in Cancer: Principles and Practice of Oncology, pp. 1 144- 
1 197 (Devita et al, eds., 5 th ed. 1997); see also Harrison 's Principles of Internal Medicine, 
pp. 1289-129 (Wilson et al, eds., 12 th ed. 5 1991). "Treatment, monitoring, detection or 
modulation of metastatic colorectal cancer" includes treatment, monitoring, detection, or 
modulation of metastatic colorectal disease in those patients who have metastatic colorectal 
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disease (Dukes stage C or D). In Dukes stage A, the tumor has penetrated into, but not 
through, the bowel wall In Dukes stage B, the tumor has penetrated through the bowel wall 
but there is not yet any lymph involvement. In Dukes stage C, the cancer involves regional 
lymph nodes. In Dukes stage D, there is distant metastasis, e.g., liver, lung, etc. 

Tables. 1-26 provide UniGene cluster identification numbers for the nucleotide 
sequence of genes that exhibit increased or decreased expression in metastasizing colorectal 
cancer samples. Tables 1-26 also provide an exemplar accession number that provides a 
nucleotide sequence that is part of the UniGene cluster. In Tables 1-26, the ratio provided 
represents primary tumor samples from known Dukes B stage survivors vs. liver metastasis 
samples from patients with metastatic colorectal cancer. In these samples, the identified 
genes are underexpressed in the metastatic samples, as the ratio is greater than one, preferably 
1.5 or greater, more preferably 2.0 or greater. In Tables 1-26, the ratio provided represents 
liver metastasis samples from patients with known metastatic colorectal cancer vs. known 
primary tumor samples from Dukes B stage survivors. In these samples, the identified genes 
are overexpressed in the metastatic samples, as the ratio is greater than one, preferably 1 .5 or 
greater, more preferably 2.0 or greater. In Tables 1-26, the ratio provided represents primary 
tumor samples from known Dukes B stage survivors vs. liver metastasis samples from 
patients with metastatic colorectal cancer. In these samples, the identified genes are 
overexpressed in the metastatic samples, as the ratio is less than one, preferably 0.5 or less, 
more preferably 0.25 or less. Survivors are subjects who have been disease free for five years 
or longer. 

In Tables 1-26, the ratio provided represents liver metastasis samples from 
patients with known metastatic disease vs. tissue samples from normal colon tissue. In these 
samples, the identified genes are overexpressed in the metastatic samples, as the ratio is 
greater than one, preferably 1 .5 or greater, more preferably 2.0 or greater. In Tables 1-26, the 
ratio represents liver metastasis samples from patients with known metastatic disease vs. 
tissue samples from normal colon tissue. In these samples, the identified genes are 
underexpressed in the metastatic samples, as the ratio is less than one, preferably 0.5 or less, 
more preferably 0.25 or less. 

One of skill will recognize that although the sequences identified in Tables 1- 
26 exhibited increased or decreased expression in metastasizing colorectal cancer samples, 
the sequences of the invention, and their encoded proteins, can be used to diagnose, treat or 
prevent cancers in patients with Dukes stage A or B colorectal cancers. Alteration of gene 



7 



WO 02/068677 



PCT/US02/06001 



expression for a gene in Tables 1-26 may be more likely or less likely to indicate that the 
subject will progress to metastatic disease. The sequences can also be used to diagnose, treat 
or prevent precancerous or benign conditions such as precancerous colon adenomas. 
Alteration of gene expression for a gene in Tables 1-26 may or may not indicate that the 
subject is more likely to progress to cancer or to metastatic disease. Thus, although the 
specification focuses primarily on metastasizing colorectal cancer, the methods described 
below can also be applied to non- metastasizing colorectal cancers (e.g., Dukes stages A and 
B) and precancerous or benign conditions (eg., precancerous adenomas) as well. 

Definitions 

The term "metastatic colorectal cancer protein" or ''metastatic colorectal 
cancer polynucleotide" or "metastatic colorectal cancer-associated transcript" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have a nucleotide sequence that has greater than about 60% nucleotide 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% or greater nucleotide sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more nucleotides, to a 
nucleotide sequence of or associated with a UniGene cluster of Tables 1-26; (2) bind to 
antibodies, e.g., polyclonal antibodies, raised against an immunogen comprising an amino 
acid sequence encoded by a nucleotide sequence of or associated with a UniGene cluster of 
Tables 1-26, and conservatively modified variants thereof, (3) specifically hybridize under 
stringent hybridization conditions to a nucleic acid sequence, or the complement thereof of 
Tables 1-26 and conservatively modified variants thereof or (4) have an amino acid sequence 
that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 80%, 85%, 
90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater amino 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide sequence 
of or associated with a UniGene cluster of Tables 1-26. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "metastatic 
colorectal cancer polypeptide" and a '^metastatic colorectal cancer polynucleotide," include 
both naturally occurring or recombinant. 
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A "full length" metastatic colorectal cancer protein or nucleic acid refers to a 
metastatic colorectal cancer polypeptide or polynucleotide sequence, or a variant thereof, that 
contains all of the elements normally contained in one or more naturally occurring, wild type 
metastatic colorectal cancer polynucleotide or polypeptide sequences. The "full length" may 
be prior to, or after, various stages of post-translation processing or splicing, including 
alternative splicing. 

biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a metastatic colorectal cancer protein, 
polynucleotide or transcript. Such samples include, but are not limited to, tissue isolated 
from primates, e.g., humans, or rodents, e.g., mice, and rats. Biological samples may also 
include sections of tissues such as biopsy and autopsy samples, frozen sections taken for 
histologic purposes, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 
fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
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and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 7. 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection (see, e.g. t Current Protocols in Molecular Biology (Ausubel et al, eds. 
1995 supplement)). 

-Preferred examples of algorithms that are suitable for detemiining percent 
, sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et aL, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol Biol 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology information (http://wwwjicbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
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words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al 9 supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Natl Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
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or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. colU or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like {see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "isolated," purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyaciylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term '^purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or ''purification" in other embodiments means 
removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 



12 



WO 02/068677 



PCTYUS02/06001 



norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 
the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. 
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The following eight groups each contain amino acids that are typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 
Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, 
e.g, Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g, Alberts et al, Molecular Biology of the Cell (3 rd ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of |J-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 
polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 1 0,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
phosphorothioate, pbosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 
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Cook, eds.. Nucleic acids containing one or more caibocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (T^) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
due to their non-ionic nature, hybridization of the bases attached to these backbones is 
relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
by those in the art, the depiction of a single strand also defines the sequence of the 
complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 
naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 
acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
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or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, e.g., detection moieties including 
radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 
functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 
hybridization conditions. The probes are preferably directly labeled as with isotopes, 
chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector I indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived ftom a cell so modified. Thus, e.g., 
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recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are otherwise abnormally expressed, under expressed 
or not expressed at all. By the tenn Recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a Recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
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wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to essentially no other sequences. Stringent conditions are sequence- 
dependent and will be different in different circumstances. Longer sequences hybridize 
specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids 
is found in Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with 
Nucleic Probes, "Overview of principles of hybridization and the strategy of nucleic acid 
assays" (1993). Generally, stringent conditions are selected to be about 5-10°C lower than 
the thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The 
T m is the temperature (under defined ionic strength, pH, and nucleic concentration) at which 
50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as fonnamide. For selective or specific hybridization, a positive signal is at least 
two times background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions are often: 50% fonnamide, 5x SSC, and 1% SDS, incubating at 
42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 0.1% SDS at 
65°C. For PCR, a temperature of about 36°C is typical for low stringency amplification, 
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although annealing temperatures may vary between about 32°C and 48°C depending on 
primer length. For high stringency PCR amplification, a temperature of about 62°C is 
typical, although high stringency annealing temperatures can range from about 50°C to about 
65°C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90°C - 95°C for 30 sec - 2 
min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 72°C for 1 - 
2 min. Protocols and guidelines for low and high stringency amplification reactions are 
provided, e.g., in Innis et al. 9 PCR Protocols, A Guide to Methods and Applications (1990). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
, degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al. 

The phrase "functional effects" in the context of assays for testing compounds 
that modulate activity of a metastatic colorectal cancer protein includes the determination of a 
parameter that is indirectly or directly undo: the influence of the metastatic colorectal cancer 
protein or nucleic acid, e.g., an enzymatic, functional, physical, or chemical effect, such as 
the ability to decrease metastatic colorectal cancer. It includes ligand binding activity; cell 
growth on soft agar; anchorage dependence; contact inhibition and density limitation of 
growth; cellular proliferation; cellular transformation; growth factor or serum dependence; 
tumor specific marker levels; invasiveness into Matrigel; tumor growth and metastasis in 
vivo; mRNA and protein expression in cells undergoing metastasis, and other characteristics 
of metastatic colorectal cancer cells, functional effects" include in vitro, in vivo, and ex 
vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 
metastatic colorectal cancer protein sequence, e.g., functional, enzymatic, physical and 
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chemical effects. Such functional effects can be measured by any means known to those 
skilled in the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, 
absorbance, refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility 
properties for the protein, measuring inducible markers or transcriptional activation of the 
metastatic colorectal cancer protein; measuring binding activity or binding assays, e.g., 
binding to antibodies or other ligands, and measuring cellular proliferation. Determination of 
the functional effect of a compound on metastatic colorectal cancer can also be performed 
using metastatic colorectal cancer assays known to those of skill in the art such as an in vitro 
assays, e.g., cell growth on soft agar, anchorage dependence; contact inhibition and density 
limitation of growth; cellular proliferation; cellular transformation; growth factor or serum 
dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth and 
metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and other 
characteristics of metastatic colorectal cancer cells. The functional effects can be evaluated 
by many means known to those skilled in the art, e.g., microscopy for quantitative or 
qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for metastatic colorectal cancer-associated sequences, measurement of 
RNA stability, identification of downstream or reporter gene expression (CAT, luciferase, (J- 
gal, GFP and the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, 
antibody binding, inducible markers, and ligand binding assays. 

'Inhibitors", "activators", and "modulators" of metastatic colorectal cancer 
polynucleotide and polypeptide sequences are used to refer to activating, inhibitory, or 
modulating molecules or compounds identified using in vitro and in vivo assays of metastatic 
colorectal cancer polynucleotide and polypeptide sequences of the invention. Inhibitors are 
compounds that, e.g., bind to, partially or totally block activity, decrease, prevent, delay 
activation, inactivate, desensitize, or down regulate the activity or expression of metastatic 
colorectal cancer proteins of the invention, e.g., antagonists. Antisense nucleic acids may 
seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate metastatic colorectal cancer protein activity. Inhibitors, activators, or modulators 
also include genetically modified versions of metastatic colorectal cancer proteins, e.g., 
versions with altered activity, as well as naturally occurring and synthetic ligands, 
antagonists, agonists, antibodies, small chemical molecules and the like. Such assays for 
inhibitors and activators include, e.g., expressing the metastatic colorectal cancer protein in 
vitro, in cells, or cell membranes, applying putative modulator compounds, and then 
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detennining the functional effects on activity, as described above. Activators and inhibitors 
of metastatic colorectal cancer can also be identified by incubating metastatic colorectal 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more metastatic colorectal cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 
or more metastatic colorectal cancer proteins, such as colorectal cancer proteins encoded by 
the sequences set out in Tables 1-26. 

Samples or assays comprising metastatic colorectal cancer proteins that are 
treated with a potential activator, inhibitor, or modulator are compared to control samples 
without the inhibitor, activator, or modulator to examine the extent of inhibition. Control 
samples (untreated with inhibitors) are assigned a relative protein activity value of 100%. 
Inhibition of a polypeptide is achieved when the activity value relative to the control is about 
80%, preferably 50%, more preferably 25-0%. Activation of a metastatic colorectal cancer 
polypeptide is achieved when the activity value relative to the control (untreated with 
activators) is 1 10%, more preferably 150%, more preferably 200-500% (i.e., two to five fold 
higher relative to the control), more preferably 1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rd ed 1994). 

'Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 
to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy {see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
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The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable.region 
genes. Light chains are classified as either kappa or lambda Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology, 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD), The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce V(ab% a 
dimer of Fab which itself is a light chain joined to V h -Ch1 by a disulfide bond. The F(ab')2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab r )' 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region {see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries {see, e.g, McCafferty et al, Nature 
348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used {see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et a!., Immunology Today 4:72 (1983); Cole et al 9 pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 
Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
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antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 
display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g. t McCafferty et al, Nature 348:552-554 
(1990); Marks et al, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant 
region, or a portion thereof is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof; is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of metastatic colorectal cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different metastatic colorectal cancer states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in metastatic 
colorectal cancer versus non-metastatic colorectal cancer tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate metastatic colorectal cancer, 
and thus tumor growth or recurrence, in a particular patient. Similarly, diagnosis and 
treatment outcomes may be done or confirmed by comparing patient samples with the known 
expression profiles. Metastatic tissue can also be analyzed to determine the stage of 
metastatic colorectal cancer in the tissue. Furthermore, these gene expression profiles (or 
individual genes) allow screening of drug candidates with an eye to mimicking or altering a 
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particular expression profile; e.g., screening can be done for drugs that suppress the 
metastatic colorectal cancer expression profile. This may be done by making biochips 
comprising sets of the important metastatic colorectal cancer genes, which can then be used 
in these screens. PCR methods may be applied with selected primer pairs, and analysis may 
be of RNA or of genomic sequences. These methods can also be done on the protein basis; 
that is, protein expression levels of the metastatic colorectal cancer proteins can be evaluated 
for diagnostic purposes or to screen candidate agents. In addition, the metastatic colorectal 
cancer nucleic acid sequences can be administered for gene therapy purposes, including the 
administration of antisense nucleic acids, or the metastatic colorectal cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs or as 
protein or DNA vaccines. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in metastatic colorectal cancer, herein termed "metastatic 
colorectal cancer sequences." As outlined below, metastatic colorectal cancer sequences 
include those that are up-regulated (i.e., expressed at a higher level) in metastatic colorectal 
cancer, as well as those that are down-regulated (i.e., expressed at a lower level). In a 
preferred embodiment, the metastatic colorectal cancer sequences are from humans; however, 
as will be appreciated by those in the art, metastatic colorectal cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other 
metastatic colorectal cancer sequences are provided, from vertebrates, including mammals, 
including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals (including 
sheep, goats, pigs, cows, horses, etc.) and pets (dogs, cats, etc.). Metastatic colorectal cancer 
sequences from other organisms may be obtained using the techniques outlined below. 

Metastatic colorectal cancer sequences can include both nucleic acid and 
amino acid sequences. As will be appreciated by those in the art and is more fully outlined 
below, metastatic colorectal cancer nucleic acid sequences are useful in a variety of 
applications, including diagnostic applications, which will detect naturally occurring nucleic 
acids, as well as screening applications; e.g., biochips comprising nucleic acid probes or PCR 
microtiter plates with selected probes to the metastatic colorectal cancer sequences can be 
generated. 

A metastatic colorectal cancer sequence can be initially identified by 
substantial nucleic acid and/or amino acid sequence homology to the metastatic colorectal 
cancer sequences outlined herein. Such homology can be based upon the overall nucleic acid 
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or amino acid sequence, and is generally determined as outlined below, using either 
homology programs or hybridization conditions. 

For identifying metastatic colorectal cancer-associated sequences, the 
metastatic colorectal cancer screen typically includes comparing genes identified in different 
tissues, e.g., normal and cancerous tissues, or tumor tissue samples from patients who have 
metastatic disease vs. non metastatic tissue, or tumor tissue samples from patients who have 
been diagnosed with Dukes stage A or B cancer but have survived vs. metastatic tissue. 
Other suitable tissue comparisons include comparing metastatic colorectal cancer samples 
with metastatic cancer samples from other cancers, such as lung, breast, other gastrointestinal 
cancers, prostate, ovarian, etc. Samples of, e.g., Dukes stage B survivor tissue and tissue 
undergoing metastasis are applied to biochips comprising nucleic acid probes. The samples 
are first microdissected, if applicable, and treated as is known in the art for the preparation of 
mRNA. Suitable biochips are commercially available, e.g., from Affymetrix. Gene 
expression profiles as described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between 
normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal colon, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, prostate, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the metastatic colorectal cancer screen that are 
expressed in significant amounts in other tissues are removed from the profile, although in 
some embodiments, this is not necessary. That is, when screening for drugs, it is usually 
preferable that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, metastatic colorectal cancer sequences are those 
that are up-regulated in metastatic colorectal cancer; that is, the expression of these genes is 
higher in the metastatic tissue as compared to non-metastatic cancerous tissue or normal 
colon tissue (see, e.g„ Tables 1-26). 'Hp-regulation" as used herein means, when the ratio is 
presented as a number greater than one, that the ratio is greater than one, preferably 1.5 or 
greater, more preferably 2.0 or greater. All UniGene cluster identification numbers and 
accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g. t Benson, DA, et a/., Nucleic Acids Research 26:1-7 (1998) and 
http://wwwjicbi.nlm.mh.gov/. Sequences are also available in other databases, e.g., 
European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
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In another preferred embodiment, metastatic colorectal cancer sequences are 
those that are down-regulated in the metastatic colorectal cancer; that is, the expression of 
these genes is lower in metastatic tissue as compared to non-metastatic cancerous tissue or 
normal colon tissue (see, e.g., Tables 1-26). <fi Down-regulation" as used herein means, when 
the ratio is presented as a number greater than one, that the ratio is greater than one, 
preferably 1.5 or greater, more preferably 2.0 or greater, or, when the ratio is presented as a 
number less than one, that the ratio is less than one, preferably 0.5 or less, more preferably 
0.25 or less. 

Informatics 

The ability to identify genes that are over or under expressed in metastatic 
colorectal cancer can additionally provide high-resolution, high-sensitivity datasets which can 
be used in the areas of diagnostics, therapeutics, drug development, pharmacogenetics, 
protein structure, biosensor development, and other related areas. For example, the 
expression profiles can be used in diagnostic or prognostic evaluation of patients with 
metastatic colorectal cancer. Or as another example, subcellular toxicological information 
can be generated to better direct drug structure and activity correlation (see Anderson, 
Pharmaceutical Proteomics: Targets, Mechanism, and Function, paper presented at the IBC 
Proteomics conference, Coronado, CA (June 1 1-12, 1998)). Subcellular toxicological 
information can also be utilized in a biological sensor device to predict the likely 
toxicological effect of chemical exposures and likely tolerable exposure thresholds (see U.S. 
Patent No. 5,8 1 1 ,23 1 ). Similar advantages accrue from datasets relevant to other 
biomolecules and bioactive agents (e.g., nucleic acids, saccharides, lipids, drugs, and the 
like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for assay data acquired using an assay of the invention. 
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The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing metastatic colorectal cancer, i.e., the identification of 
metastatic colorectal cancer-associated sequences described herein, provide an abundance of 
information, which can be correlated with pathological conditions, predisposition to disease, 
drug testing, therapeutic monitoring, gene-disease causal linkages, identification of correlates 
of immunity and physiological status, among others. Although the data generated Scorn the 
assays of the invention is suited for manual review and analysis, in a preferred embodiment, 
prior data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount et al, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et al, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 
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Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
al 9 eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist 's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing 
sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for metastatic colorectal cancer. 
In another variation, the assay records cross-tabulate one or more of the following parameters 
for each target species in a sample: (1) a unique identification code, which can include, e.g., a 
target molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
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comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFET) and/or the comparison may 
be of die relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
(e.g., Linux, SunOS, Solaris, AK, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result 

The target data or record and the computer program can be transferred to 
secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
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a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
PA-8000, SPARC, MIPS 4400, MPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer, (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of metastatic colorectal cancer-associated proteins 

Metastatic colorectal cancer proteins of the present invention may be classified 
as secreted proteins, transmembrane proteins or intracellular proteins. In one embodiment, 
the metastatic colorectal cancer protein is an intracellular protein. Intracellular proteins may 
be found in the cytoplasm and/or in the nucleus and/or in the organelles. Proteins containing 
one or more transmembrane domains that exclusively reside in organelles are also considered 
intracellular proteins. Intracellular proteins are involved in all aspects of cellular function 
and replication (including, e.g., signaling pathways); aberrant expression of such proteins 
often results in unregulated or disregulated cellular processes {see, e.g. t Molecular Biology of 
the Cell (Alberts, ed., 3rd e<L, 1994). For example, many intracellular proteins have 
enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
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interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g. 9 Bateman et al, Nuc. 
Acids Res. 28:263-266 (2000); Sonnhammer et al. t Proteins 28:405-420 (1997); Bateman et 
al, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et al, Nuc. Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the metastatic colorectal cancer sequences are 
transmembrane proteins. Transmembrane proteins are molecules that span a phospholipid 
bilayer of a cell. They may have an intracellular domain, an extracellular domain, or both. 
The intracellular domains of such proteins may have a number of functions including those 
already described for intracellular proteins. For example, the intracellular domain may have 
enzymatic activity and/or may serve as a binding site for additional proteins. Frequently the 
intracellular domain of transmembrane proteins serves both roles. For example certain 
receptor tyrosine kinases have both protein kinase activity and SH2 domains. In addition, 
autophosphorylation of tyrosines on the receptor molecule itself, creates binding sites for 
additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 
transmembrane domain. However, various other proteins including channels, pumps, and 
adenylyl cyclases contain numerous transmembrane domains. Many important cell surface 
receptors such as G protein coupled receptors (GPCRs) are classified as "seven 
transmembrane domain" proteins, as they contain 7 membrane spanning regions. 
Characteristics of transmembrane domains include approximately 20 consecutive 



31 



WO 02/068677 PCT/US02/06001 

hydrophobic amino acids that may be followed by charged amino acids. Therefore, upon 
analysis of the amino acid sequence of a particular protein, the localization and number of 
transmembrane domains within the protein may be predicted (see, e.g. PSORT web site 
http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, hormones, neurotrophic factors and the like. Extracellular 
domains also bind to cell-associated molecules. In this respect, they mediate cell-cell 
interactions. Cell-associated ligands can be tethered to the cell, e.g., via a 
glycosylphosphatidylinositol (GPI) anchor, or may themselves be transmembrane proteins. 
Extracellular domains also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

Metastatic colorectal cancer proteins that are transmembrane are particularly 
preferred in the present invention as they are readily accessible targets for extracellular 
immunotherapeutics, as are described herein. In addition, as outlined below, transmembrane 
proteins can be also useful in imaging modalities. Antibodies may be used to label such 
readily accessible proteins in situ or in histological analysis. Alternatively, antibodies can 
also label intracellular proteins, in which case analytical samples are typically penneablized 
to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the metastatic colorectal cancer proteins are secreted 
proteins; the secretion of which can be either constitutive or regulated. These proteins have a 
signal peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they often serve to transmit signals to various other cell types. The secreted protein may 
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function in an autocrine manner (acting on. the cell that secreted the factor), a paracrine 
manner (acting on cells in close proximity to the cell that secreted the factor) or an endocrine 
manner (acting on cells at a distance). Thus secreted molecules find use in modulating or 
altering numerous aspects of physiology. Metastatic colorectal cancer proteins that are 
secreted proteins are particularly preferred in the present invention as they serve as good 
targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

Use of metastatic colorectal cancer nucleic acids 

As described above, metastatic colorectal cancer sequence is initially 
identified by substantial nucleic acid and/or amino acid sequence homology or linkage to the 
metastatic colorectal cancer sequences outlined herein. Such homology can be based upon 
the overall nucleic acid or amino acid sequence, and is generally determined as outlined 
below, using either homology programs or hybridization conditions. Typically, linked 
sequences on a mRNA are found on the same molecule. 

The metastatic colorectal cancer nucleic acid sequences of the invention, e.g., 
the sequences in Tables 1-26, can be fragments of larger genes, i.e., they are nucleic acid 
segments. "Genes" in this context includes coding regions, non-coding regions, and mixtures 
of coding and non-coding regions. Accordingly, as will be appreciated by those in the art, 
using the sequences provided herein, extended sequences, in either direction, of the 
metastatic colorectal cancer genes can be obtained, using techniques well known in the art for 
cloning either longer sequences or the full length sequences; see Ausubel, et aL, supra. 
Much can be done by informatics and many sequences can be clustered to include multiple 
sequences corresponding to a single gene, e.g., systems such as UniGene (see, 
http://www.ncbi.nlm.nih.gov/unigene/). 

Once the metastatic colorectal cancer nucleic acid is identified, it can be 
cloned and, if necessary, its constituent parts recombined to form the entire metastatic 
colorectal cancer nucleic acid coding regions or the entire mRNA sequence. Once isolated 
from its natural source, e.g., contained within a plasmid or other vector or excised therefrom 
as a linear nucleic acid segment, the recombinant metastatic colorectal cancer nucleic acid 
can be further-used as a probe to identify and isolate other metastatic colorectal cancer 
nucleic acids, e.g., extended coding regions. It can also be used as a "precursor" nucleic acid 
to make modified or variant metastatic colorectal cancer nucleic acids and proteins. 

The metastatic colorectal cancer nucleic acids of the present invention are 
used in several ways. In a first embodiment, nucleic acid probes to the metastatic colorectal 
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cancer nucleic acids are made and attached to biochips to be used in screening and diagnostic 
methods, as outlined below, or for administration, e.g., for gene therapy, vaccine, and/or 
antisense applications. Alternatively, the metastatic colorectal cancer nucleic acids that 
include coding regions of metastatic colorectal cancer proteins can be put into expression 
vectors for the expression of metastatic colorectal cancer proteins, again for screening 
purposes or for administration to a patient 

In a preferred embodiment, nucleic acid probes to metastatic colorectal cancer 
nucleic acids (both the nucleic acid sequences outlined in the figures and/or the complements 
thereof) are made. The nucleic acid probes attached to the biochip are designed to be 
substantially complementary to the metastatic colorectal cancer nucleic acids, i.e. the target 
sequence (either the target sequence of the sample or to other probe sequences, e.g., in 
sandwich assays), such that hybridization of the target sequence and the probes of the present 
invention occurs. As outlined below, this complementarity need not be perfect; there may be 
any number of base pair mismatches which will interfere with hybridization between the 
target sequence and the single stranded nucleic acids of the present invention. However, if 
the number of mutations is so great that no hybridization can occur under even the least 
stringent of hybridization conditions, the sequence is not a complementary target sequence. 
Thus, by "substantially complementary" herein is meant that the probes are sufficiently 
complementary to the target sequences to hybridize under appropriate reaction conditions, 
particularly high stringency conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 
lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
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equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent By "non- 
covalent binding" and grammatical equivalents herein is typically meant one or more of 
electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 
the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 
be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as 
will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, Teflon, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
. particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known {see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3* terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of metastatic colorectal cancer-associated sequences. These assays are typically 
performed in conjunction with reverse transcription. In such assays, a metastatic colorectal 
cancer-associated nucleic acid sequence acts as a template in an amplification reaction (e.g., 
Polymerase Chain Reaction, or PCR). In a quantitative amplification, the amount of 
amplification product will be proportional to the amount of template in the original sample. 
Comparison to appropriate controls provides a measure of the amount of metastatic colorectal 
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cancer-associated RNA Methods of quantitative amplification are well known to those of 
skill in the art. Detailed protocols for quantitative PCR are provided, e.g., in Innis et al, PCR 
Protocols, A Guide to Methods and Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
amplification (see, e.g., literature provided by Parkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et al, Science 
241:1077 (1988), and Barringer et al., Gene 89:1 17 (1990)), transcription amplification 
(Kwoh et al, Proc. Natl Acad. Sci. USA 86:1173 (1989)), self-sustained sequence replication 
(Guatelli et al, Proa Nau Acad. Set USA 87:1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of metastatic colorectal cancer proteins from nucleic acids 

In a preferred embodiment, metastatic colorectal cancer nucleic acids, e.g., 
encoding metastatic colorectal cancer proteins, are used to make a variety of expression 
vectors to express metastatic colorectal cancer proteins which can then be used in screening 
assays, as described below. Expression vectors and recombinant DNA technology are well 
known to those of skill in the art (see, e.g., Ausubel, supra, and Gene Expression Systems 
(Fernandez & Hoeffler, eds, 1999)) and are used to express proteins. The expression vectors 
may be either self-replicating extrachromosomal vectors or vectors which integrate into a 
host genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the metastatic colorectal 
cancer protein. The term "control sequences" refers to DNA sequences used for the 
expression of an operably linked coding sequence in a particular host organism. Control 
sequences that are suitable for prokaryotes, e.g., include a promoter, optionally an operator 
sequence, and a ribosome binding site. Eukaiyotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 
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Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the metastatic 
colorectal cancer protein. Numerous types of appropriate expression vectors, and suitable 
regulatory sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g., in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 
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In addition, in a preferred embodiment, the expression vector contains a 
selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The metastatic colorectal cancer proteins of the present invention are produced 
by culturing a host cell transformed with an expression vector containing nucleic acid 
encoding a metastatic colorectal cancer protein, under the appropriate conditions to induce or 
cause expression of the metastatic colorectal cancer protein. Conditions appropriate for 
metastatic colorectal cancer protein expression will vary with the choice of the expression 
vector and the host cell, and will be easily ascertained by one skilled in the art through 
routine experimentation or optimization. For example, the use of constitutive promoters in 
the expression vector will require optimizing the growth and proliferation of the host cell, 
while the use of an inducible promoter requires the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is import For 
example, the baculoviral systems used in insect cell expression are lytic viruses, and thus 
harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the metastatic colorectal cancer proteins are 
expressed in mammalian cells. Mammalian expression systems are also known in the art, and 
include retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e.g„ Fernandez & Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3' to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenylation signals include 
those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
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poiybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, metastatic colorectal cancer proteins are expressed 
in bacterial systems. Promoters from bacteriophage may also be used and are known in the 
art. In addition, synthetic promoters and hybrid promoters are also useful; e.g., the tac 
promoter is a hybrid of the trp and lac promoter sequences. Furthermore, a bacterial 
promoter can include naturally occurring promoters of non-bacterial origin that have the 
ability to bind bacterial RNA polymerase and initiate transcription. In addition to a 
functioning promoter sequence, an efficient ribosome binding site is desirable. The 
expression vector may also include a signal peptide sequence that provides for secretion of 
the metastatic colorectal cancer protein in bacteria. The protein is either secreted into the 
growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). The bacterial expression 
vector may also include a selectable marker gene to allow for the selection of bacterial strains 
that have been transformed. Suitable selection genes include genes which render the bacteria 
resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin 
and tetracycline. Selectable markers also include biosynthetic genes, such as those in the 
histidine, tryptophan and leucine biosynthetic pathways. These components are assembled 
into expression vectors. Expression vectors for bacteria are well known in the art, and 
include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, and Streptococcus 
lividans, among others (e.g., Fernandez & Hoeffier, supra). The bacterial expression vectors 
are transformed into bacterial host cells using techniques well known in the art, such as 
calcium chloride treatment, electroporation, and others. 

In one embodiment, metastatic colorectal cancer proteins are produced in 
insect cells. Expression vectors for the transformation of insect cells, and in particular, 
baculovirus-based expression vectors, are well known in the art. 

In a preferred embodiment, metastatic colorectal cancer protein is produced in 
yeast cells. Yeast expression systems are well known in the art, and include expression 
vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula 
polymorpha, Kluyveromyces fragilis and AT. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarroma lipolytica. 

The metastatic colorectal cancer protein may also be made as a fusion protein, 
using techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, 
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if the desired epitope is small, the metastatic colorectal cancer protein may be fused to a 
carrier protein to form an immunogen. Alternatively, the metastatic colorectal cancer protein 
may be made as a fusion protein to increase expression for affinity purification purposes, or 
for other reasons. For example, when the metastatic colorectal cancer protein is a metastatic 
colorectal cancer peptide, the nucleic acid encoding the peptide may be linked to other 
nucleic acid for expression purposes. 

In a preferred embodiment, the metastatic colorectal cancer protein is purified 
or isolated after expression. Metastatic colorectal cancer proteins may be isolated or purified 
in a variety of appropriate ways. Standard purification methods include electrophoretic, 
molecular, immunological and chromatographic techniques, including ion exchange, 
hydrophobic, affinity, and reverse-phase HPLC chromatography, and chromatofocusing. For 
example, the metastatic colorectal cancer protein may be purified using a standard anti- 
metastatic colorectal cancer protein antibody column. Ultrafiltration and diafiltration 
techniques, in conjunction with protein concentration, are also useful. For general guidance 
in suitable purification techniques, see Scopes, Protein Purification (1982). The degree of 
purification necessary will vary depending on the use of the metastatic colorectal cancer 
protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the metastatic colorectal cancer 
proteins and nucleic acids are useful in a number of applications. They may be used as 
immunoselection reagents, as vaccine reagents, as screening agents, etc. 

Variants of metastatic colorectal cancer proteins 

In one embodiment, the metastatic colorectal cancer proteins are derivative or 
variant metastatic colorectal cancer proteins as compared to the wild-type sequence. That is, 
as outlined more fully below, the derivative metastatic colorectal cancer peptide will often 
contain at least one amino acid substitution, deletion or insertion, with amino acid 
substitutions being particularly preferred. The amino acid substitution, insertion or deletion 
may occur at a particular residue within the metastatic colorectal cancer peptide. 

Also included within one embodiment of metastatic colorectal cancer proteins 
of the present invention are amino acid sequence variants. These variants typically fall into 
one or more of three classes: substitutional, insertional or deletional variants. These variants 
ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the 
metastatic colorectal cancer protein, using cassette or PCR mutagenesis or other techniques, 
to produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
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culture as outlined above. However, variant metastatic colorectal cancer protein fragments 
having up to about 100-150 residues may be prepared by in vitro synthesis. Amino acid 
sequence variants are characterized by the predetermined nature of the variation, a feature 
that sets them apart from naturally occurring allelic or interspecies variation of the metastatic 
colorectal cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
often predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed metastatic colorectal cancer 
variants screened for the optimal combination of desired activity. Techniques exist for 
making substitution mutations at predetermined sites in DNA having a known sequence, e.g., 
Ml 3 primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using 
assays of metastatic colorectal cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be occasionally tolerated Deletions range from about 1 to about 20 residues, 
although in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. Larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of a metastatic colorectal cancer 
protein are desired, substitutions are generally made in accordance with the amino acid 
substitution chart provided in the definition sectioa 

Variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the metastatic colorectal cancer proteins as needed. 
Alternatively, the variant may be designed or reorganized such that the biological activity of 
the metastatic colorectal cancer protein is altered. For example, glycosylation sites may be 
altered or removed. 

Covalent modifications of metastatic colorectal cancer polypeptides are 
included within the scope of this invention. One type of covalent modification includes 
reacting targeted amino acid residues of a metastatic colorectal cancer polypeptide with an 
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organic derivatizing agent that is capable of reacting with selected side chains or the N-or C- 
tenninal residues of a metastatic colorectal cancer polypeptide. Derivatization with 
bifimctional agents is useful, for instance, for crosslinking metastatic colorectal cancer 
polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti-metastatic colorectal cancer polypeptide antibodies or screening assays, as is 
more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
" bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3 , -dithiobis(succinimidylpropionate), bifimctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)propioiniidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton, 
Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
tenninal amine, and amidation of any C -terminal carboxyl group. 

Another type of covalent modification of the metastatic colorectal cancer 
polypeptide encompassed by this invention is an altered native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended herein to mean adding to 
or deleting one or more carbohydrate moieties of a native sequence metastatic colorectal 
cancer polypeptide. Glycosylation patterns can be altered in many ways. For example the 
use of different cell types to express metastatic colorectal cancer-associated sequences can 
result in different glycosylation patterns. 

Addition of glycosylation sites to metastatic colorectal cancer polypeptides 
may also be accomplished by altering the amino acid sequence thereof. The alteration may 
be made, e.g., by the addition of, or substitution by, one or more serine or threonine residues 
to the native sequence metastatic colorectal cancer polypeptide (for O-linked glycosylation 
sites). The metastatic colorectal cancer amino acid sequence may optionally be altered 
through changes at the DNA level, particularly by mutating the DNA encoding the metastatic 
colorectal cancer polypeptide at preselected bases such that codons are generated that will 
translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
metastatic colorectal cancer polypeptide is by chemical or enzymatic coupling of glycosides 
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to the polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in 
Aplin & Wriston, CRC Crit Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the metastatic colorectal cancer 
polypeptide may be accomplished chemically or enzymatically or by mutational substitution 
of codons encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al,Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al, Anal. Biochem., 118:131 
(1981). En2ymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al, Meth. 
EnzymoL, 138:350 (1987). 

Another type of covalent modification of metastatic colorectal cancer 
comprises linking the metastatic colorectal cancer polypeptide to one of a variety of 
nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Metastatic colorectal cancer polypeptides of the present invention may also be 
modified in a way to form chimeric molecules comprising a metastatic colorectal cancer 
polypeptide fused to another, heterologous polypeptide or amino acid sequence. In one 
embodiment, such a chimeric molecule comprises a fusion of a metastatic colorectal cancer 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody 
can selectively bind The epitope tag is generally placed at the amino-or carboxyl-terminus 
of the metastatic colorectal cancer polypeptide. The presence of such epitope-tagged forms 
of a metastatic colorectal cancer polypeptide can be detected using an antibody against the 
tag polypeptide. Also, provision of the epitope tag enables the metastatic colorectal cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a metastatic colorectal cancer polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known and 
examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et al, 
Mol Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al % 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al t BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et at, 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al> /. Biol Chem. 
266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et at, 
Proc. Natl Acad, ScL USA 87:6393-6397 (1990)). 

Also included are other metastatic colorectal cancer proteins of the metastatic 
colorectal cancer family, and metastatic colorectal cancer proteins from other organisms, 
which are cloned and expressed as outlined below. Thus, probe or degenerate polymerase 
chain reaction (PCR) primer sequences may be used to find other related metastatic colorectal 
cancer proteins from primates or other organisms. As will be appreciated by those in the art, 
particularly useful probe and/or PCR primer sequences include unique areas of the metastatic 
colorectal cancer nucleic acid sequence. As is generally known in the art, preferred PCR 
primers are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 
being preferred, and may contain inosine as neieded. PCR reaction conditions are well known 
in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to metastatic colorectal cancer proteins 

In a preferred embodiment, when a metastatic colorectal cancer protein is to 
be used to generate antibodies, e.g., for immunotherapy or immunodiagnosis, the metastatic 
colorectal cancer protein should share at least one epitope or determinant with the full length 
protein. By "epitope" or "determinant" herein is typically meant a portion of a protein which 
will generate and/or bind an antibody or T-cell receptor in the context of MHC. Thus, in 
most instances, antibodies made to a smaller metastatic colorectal cancer protein will be able 
to bind to the full-length protein, particularly linear epitopes. In a preferred embodiment, the 
epitope is unique; that is, antibodies generated to a unique epitope show little or no cross- 
reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, 
supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., 
by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1-26 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 
the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 
immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
26, or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes (*TBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and primate origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are typically monoclonal, preferably human or humanized, antibodies that have 
binding specificities for at least two different antigens or that have binding specificities for 
two epitopes on the same antigen. In one embodiment, one of the binding specificities is for 
a protein encoded by a nucleic acid of Tables 1-26 or a fragment thereof the other one is for 
any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to metastatic colorectal cancer 
protein are capable of reducing or eliminating a biological function of a metastatic colorectal 
cancer protein, as is described below. That is, the addition of anti-metastatic colorectal 
cancer protein antibodies (either polyclonal or preferably monoclonal) to metastatic 
colorectal cancer tissue (or cells containing metastatic colorectal cancer) may reduce or 
eliminate the metastatic colorectal cancer. Generally, at least a 25% decrease in activity, 
growth, size or the like is preferred, with at least about 50% being particularly preferred and 
about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the metastatic colorectal cancer 
proteins are humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., 
Protein Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are 
chimeric molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such 
as Fv, Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in die 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et aL> Nature 321 :522-525 (1986); Riechmann et aL, Nature 
332:323-329 (1988); and Presta, Curr. Op. Struct. Biol 2:593-596 (1992)). Humanization 
can be essentially performed following the method of Winter and co-workers (Jones et aL, 
Nature 321:522-525 (1986); Riechmann et aL, Nature 332:323-327 (1988); Verhoeyen et al. 9 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
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human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human-like antibodies can also be produced using various techniques known 
in the art, including phage display libraries (Hoogenboom & Winter, J. Mol Biol 227:381 
(1991); Maries etal.,J.Mol.BioL 222:581 (1991)). The techniques of Cole et al and 
Boerner et al are also available for the preparation of human monoclonal antibodies (Cole et 
al, Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boerner et al, J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 
human antibody production is observed, which closely resembles that seen in humans in 
virtually all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks et al 9 
Bio/Technology 10:779-783 (1992); Lonberg et al, Nature 368:856-859 (1994); Morrison, 
Nature 368:812-13 (1994); Fishwild etal 9 Nature Biotechnology 14:845-51 (1996); 
Neuberger, Nature Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol 
13:65-93 (1995). 

By immunotherapy is meant treatment of metastatic colorectal cancer with an 
antibody raised against a metastatic colorectal cancer proteins. As used herein, 
immunotherapy can be passive or active. Passive immunotherapy as defined herein is the 
passive transfer of antibody to a recipient (patient). Active immunization is the induction of 
antibody and/or T-cell responses in a recipient (patient). Induction of an immune response is 
the result of providing the recipient with an antigen to which antibodies are raised. The 
antigen may be provided by injecting a polypeptide against which antibodies are desired to be 
raised into a recipient, or contacting the recipient with a nucleic acid capable of expressing 
the antigen and under conditions for expression of the antigen, leading to an immune 
response. 

In a preferred embodiment the metastatic colorectal cancer proteins against 
which antibodies are raised are secreted proteins as described above. Without being bound 
by theory, antibodies used for treatment, bind and prevent the secreted protein from binding 
to its receptor, thereby inactivating the secreted metastatic colorectal cancer protein. 

In another preferred embodiment, the metastatic colorectal cancer protein to 
which antibodies are raised is a transmembrane protein. Without being bound by theory, 
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antibodies used for this treatment typically bind the extracellular domain of the metastatic 
colorectal cancer protein and prevent it from binding to other proteins, such as circulating 
ligands or cell-associated molecules. The antibody may cause down-regulation of the 
transmembrane metastatic colorectal cancer protein. The antibody may be a competitive, 
non-competitive or uncompetitive inhibitor of protein binding to the extracellular domain of 
the metastatic colorectal cancer protein. The antibody may be an antagonist of the metastatic 
colorectal cancer protein or may prevent activation of the transmembrane metastatic 
colorectal cancer protein. In some embodiments, when the antibody prevents the binding of 
other molecules to the metastatic colorectal cancer protein, the antibody prevents growth of 
the cell. The antibody may also be used to target or sensitize the cell to cytotoxic agents, 
including, but not limited to TNF-a, TNF-P, IL-1, INF-y and EL-2, or chemotherapeutic 
agents including SFU, vinblastine, actinomycin D, cisplatin, methotrexate, and the like. In 
some instances the antibody belongs to a sub-type that activates serum complement when 
complexed with the transmembrane protein thereby mediating cytotoxicity or antigen- 
dependent cytotoxicity (ADCC). Thus, metastatic colorectal cancer is treated by 
administering to a patient antibodies directed against the transmembrane metastatic colorectal 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector 
moiety. The effector moiety can be any number of molecules, including labeling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the metastatic 
colorectal cancer protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the metastatic colorectal cancer protein. 
The therapeutic moiety may inhibit enzymatic activity such as protease or collagenase 
activity associated with metastatic colorectal cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to metastatic colorectal cancer tissue or 
cells results in a reduction in the number of afflicted cells, thereby reducing symptoms 
associated with metastatic colorectal cancer. Cytotoxic agents are numerous and varied and 
include, but are not limited to, cytotoxic drugs or toxins or active fragments of such toxins. 
Suitable toxins and their corresponding fragments include diphtheria A chain, exotoxin A 
chain, ricin A chain, abrin A chain, curcin, crotin, phenomycin, enomycin and the like. 
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Cytotoxic agents also include radiochemicals made by conjugating radioisotopes to 
antibodies raised against metastatic colorectal cancer proteins, or binding of a radionuclide to 
a chelating agent that has been covalently attached to the antibody. Targeting the therapeutic 
moiety to transmembrane metastatic colorectal cancer proteins not only serves to increase the 
local concentration of therapeutic moiety in the metastatic colorectal cancer afflicted area, but 
also serves to reduce deleterious side effects that may be associated with the therapeutic 
moiety. 

In another preferred embodiment, the metastatic colorectal cancer protein 
against which the antibodies are raised is an intracellular protein. In this case, the antibody 
may be conjugated to a protein or other entity which facilitates entry into the cell. In one 
case, the antibody enters the cell by endocytosis. In another embodiment, a nucleic acid 
encoding the antibody is administered to the individual or cell. Moreover, wherein the 
metastatic colorectal cancer protein can be targeted within a cell, i.e., the nucleus, an 
antibody thereto contains a signal for that target localization, i.e., a nuclear localization 
signal. 

The metastatic colorectal cancer antibodies of the invention specifically bind 
to metastatic colorectal cancer proteins. By "specifically bind" herein is meant that the 
antibodies bind to the protein with a Kd of at least about 0.1 mM, more usually at least about 
1 pM, preferably at least about 0.1 jxM or better, and most preferably, 0.01 \M or better. 
Selectivity of binding is also important. 

Detection of metastatic colorectal cancer sequence for diagnostic and therapeutic 
applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the metastatic colorectal cancer phenotype. Expression levels of genes in 
normal tissue (i.e., not undergoing metastatic colorectal cancer) and in metastatic colorectal 
cancer tissue (and in some cases, for varying severities of metastatic colorectal cancer that 
relate to prognosis, as outlined below) are evaluated to provide expression profiles. An 
expression profile of a particular cell state or point of development is essentially a 
"fingerprint" of the state. While two states may have any particular gene similarly expressed, 
the evaluation of a number of genes simultaneously allows the generation of a gene 
expression profile that is reflective of the state of the cell. By comparing expression profiles 
of cells in different states, information regarding which genes are important (including both 
up* and down-regulation of genes) in each of these states is obtained. Then, diagnosis may 
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be performed or confirmed to determine whether a tissue sample has the gene expression 
profile of normal or cancerous tissue. This will provide for molecular diagnosis of related 
conditions. 

'Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus metastatic colorectal cancer tissue. Genes may be turned on or turned off in a 
particular state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 
that expression is increased or decreased; i.e., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 
150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the metastatic 
colorectal cancer protein and standard immunoassays (ELISAs, etc.) or other techniques, 
including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins 
corresponding to metastatic colorectal cancer genes, i.e., those identified as being important 
in a metastatic colorectal cancer phenotype, can be evaluated in a metastatic colorectal cancer 
diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 
simultaneously on a number of genes. 
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The metastatic colorectal cancer nucleic acid probes may be attached to 
biochips as outlined herein for the detection and quantification of metastatic colorectal cancer 
sequences in a particular cell. The assays are further described below in the example. PGR 
techniques can be used to provide greater sensitivity. Multiple protein expression monitoring 
can be performed as well. Similarly, these assays may be performed on an individual basis as 
well. 

hi a preferred embodiment nucleic acids encoding the metastatic colorectal 
cancer protein are detected. Although DNA or RNA encoding the metastatic colorectal 
cancer protein may be detected, of particular interest are methods wherein an mRNA 
encoding a metastatic colorectal cancer protein is detected. Probes to detect mRNA can be a 
nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the mRNA 
and includes, but is not limited to, oligonucleotides, cDNA or RNA. Probes also should 
contain a detectable label, as defined herein. In one method the mRNA is detected after 
immobilizing the nucleic acid to be examined on a solid support such as nylon membranes 
and hybridizing the probe with the sample. Following washing to remove the non- 
specifically bound probe, the label is detected. In another method detection of the mRNA is 
performed in situ. In this method permeabilized cells or tissue samples are contacted with a 
detectably labeled nucleic acid probe for sufficient time to allow the probe to hybridize with 
the target mRNA. Following washing to remove the non-specifically bound probe, the label 
is detected. For example a digoxygenin labeled riboprobe (RNA probe) that is 
complementary to the mRNA encoding a metastatic colorectal cancer protein is detected by 
binding the digoxygenin with an anti-digoxygenin secondary antibody and developed with 
nitro blue tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The metastatic colorectal cancer proteins, antibodies, nucleic acids, modified proteins 
and cells containing metastatic colorectal cancer sequences are used in diagnostic assays. 
This can be performed on an individual gene or corresponding polypeptide level. In a 
preferred embodiment, the expression profiles are used, preferably in conjunction with high 
throughput screening techniques to allow monitoring for expression profile genes and/or 
corresponding polypeptides. 

As described and defined herein, metastatic colorectal cancer proteins, 
including intracellular, transmembrane or secreted proteins, find use as markers of metastatic 
colorectal cancer. Detection of these proteins in putative metastatic colorectal cancer tissue 
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allows for detection or diagnosis of metastatic colorectal cancer. In one embodiment, 
antibodies are used to detect metastatic colorectal cancer proteins. A preferred method 
separates proteins from a sample by electrophoresis on a gel (typically a denaturing and 
reducing protein gel, but may be another type of gel, including isoelectric focusing gels and 
the like). Following separation of proteins, the metastatic colorectal cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the metastatic colorectal 
cancer protein. Methods of immunoblotting are well known to those of ordinary skill in the 
art. 

In another preferred method, antibodies to the metastatic colorectal cancer 
protein find use in in situ imaging techniques, e.g., in histology (e.g., Methods in Cell 
Biology: Antibodies in Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are 
contacted with from one to many antibodies to the metastatic colorectal cancer protein(s). 
Following washing to remove non-specific antibody binding, the presence of the antibody or 
antibodies is detected. In one embodiment the antibody is detected by incubating with a 
secondary antibody that contains a detectable label, e.g., multicolor fluorescence or confocal 
imaging. In another method the primary antibody to the metastatic colorectal cancer 
protein(s) contains a detectable label, e.g., an enzyme marker that can act on a substrate. In 
another preferred embodiment each one of multiple primary antibodies contains a distinct 
and detectable label. This method finds particular use in simultaneous screening for a 
plurality of metastatic colorectal cancer proteins. Many other histological imaging 
techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing metastatic 
colorectal cancer from blood, serum, plasma, stool, and other samples. Such samples, 
therefore, are useful as samples to be probed or tested for the presence of metastatic 
colorectal cancer proteins. Antibodies can be used to detect a metastatic colorectal cancer 
protein by previously described immunoassay techniques including ELISA, immunoblotting 
(western blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the 
presence of antibodies may indicate an immune response against an endogenous metastatic 
colorectal cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled metastatic 
colorectal cancer nucleic acid probes to tissue arrays is done. For example, arrays of tissue 
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samples, including metastatic colorectal cancer tissue and/or normal tissue, are made. In situ 
hybridization (see, e.g., Ausubel, supra) is then performed When comparing the fingerprints 
between an individual and a standard, the skilled artisan can make a diagnosis, a prognosis, or 
a prediction based on the findings. It is further understood that the genes which indicate the 
diagnosis may differ from those which indicate the prognosis and molecular profiling of the 
condition of the cells may lead to distinctions between responsive or refractory conditions or 
may be predictive of outcomes. 

In a preferred embodiment, the metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing metastatic colorectal cancer 
sequences are used in prognosis assays. As above, gene expression profiles can be generated 
that correlate to metastatic colorectal cancer, in terms of long term prognosis. Again, this 
may be done on either a protein or gene level, with the use of genes being preferred. As 
above, metastatic colorectal cancer probes may be attached to biochips for the detection and 
quantification of metastatic colorectal cancer sequences in a tissue or patient The assays 
proceed as outlined above for diagnosis. PGR method may provide more sensitive and 
accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing metastatic colorectal cancer 
sequences are used in drug screening assays or by evaluating the effect of drug candidates on 
a "gene expression profile" or expression profile of polypeptides. In a preferred embodiment, 
the expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
agent (e.g., Zlokarnik, et al, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing the native or modified 
metastatic colorectal cancer proteins are used in screening assays. That is, the present 
invention provides novel methods for screening for compositions which modulate the 
metastatic colorectal cancer phenotype or an identified physiological function of a metastatic 
colorectal cancer protein. As above, this can be done on an individual gene level or by 
evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
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screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays 
may be applied In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene with altered regulation in metastatic 
colorectal cancer, test compounds can be screened for the ability to modulate gene expression 
or for binding to the metastatic colorectal cancer protein. "Modulation" thus includes an 
increase or a decrease in gene expression. The preferred amount of modulation will depend 
on the original change of the gene expression in normal versus tissue undergoing metastatic 
colorectal cancer, with changes of at least 10%, preferably 50%, more preferably 100-300%, 
and in some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold increase in 
metastatic colorectal cancer tissue compared to normal tissue, a decrease of about four-fold is 
often desired; similarly, a 10-fold decrease in metastatic colorectal cancer tissue compared to 
normal tissue often provides a target value of a 10-fold increase in expression to be induced 
by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the metastatic colorectal cancer protein 
and standard immunoassays. Proteomics and separation techniques may also allow 
quantification of expression. 

In a preferred embodiment, gene or protein expression monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein. 

In this embodiment, the metastatic colorectal cancer nucleic acid probes are 
attached to biochips as outlined herein for the detection and quantification of metastatic 
colorectal cancer sequences in a particular cell. Alternatively, PCR may be used Thus, a 
series, e.g., of microtiter plate, may be used with dispensed primers in desired wells. A PCR 
reaction can then be performed and analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more metastatic colorectal cancer-associated sequences, e.g., a 
polynucleotide sequence set out in Tables 1-26. Generally, in a preferred embodiment, a test 
compound is added to the cells prior to analysis. Moreover, screens are also provided to 
identify agents that modulate metastatic colorectal cancer, modulate metastatic colorectal 
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cancer proteins, bind to a metastatic colorectal cancer protein, or interfere with the binding of 
a metastatic colorectal cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the metastatic colorectal cancer phenotype or the expression of a metastatic 
colorectal cancer sequence, e.g., a nucleic acid or protein sequence. In preferred 
embodiments, modulators alter expression profiles of nucleic acids or proteins provided 
herein. In one embodiment, the modulator suppresses a metastatic colorectal cancer 
phenotype, e.g., to a normal tissue fingerprint. In another embodiment, a modulator induces a 
metastatic colorectal cancer phenotype. Generally, a plurality of assay mixtures are run in 
parallel with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control, i.e., at 
zero concentration or below the level of detection. 

In one aspect, a modulator will neutralize the effect of a metastatic colorectal 
cancer protein. By "neutralize" is meant that activity of a protein and the consequent effect 
on the cell is inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a metastatic colorectal cancer polypeptide or to modulate 
activity. Conventionally, new chemical entities with useful properties are generated by 
identifying a chemical compound (called a "lead compound") with some desirable property 
or activity, e.g., inhibiting activity, creating variants of the lead compound, and evaluating the 
property and activity of those variant compounds. Often, high throughput screening (HTS) 
methods are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
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chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal, J. Med. Chem. 37(9):1233-1251 (1994)). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the ait Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Fuika, Pept Prot. Res. 37:487-493 
(1991), Houghton et al, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 
Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proc. Nat Acad Set USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al 9 J. Amer. Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et al 9 J* Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
of small compound libraries (Chen et al 9 J. Amer. Chem. Soc. 1 16:2661 (1994)), 
oligocarbamates (Cho, et ai 9 Science 261 : 1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, J. Org. Chem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al, Nature 
Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 
e.g., Liang et al., Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 
U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g„ 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
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synthetic operations performed by a chemist. The above devices, with appropriate 
modification, are suitable for use with the present invention. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g M ComGenex, 
Princeton, NJ., Asinex, Moscow, Ru, Tripos, Inc., St Louis, MO, ChemStar, Ltd, Moscow, 
RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of metastatic colorectal cancer gene transcription, 
polypeptide expression, and polypeptide activity. 

High throughput assays for evaluating the presence, absence, quantification, or 
other properties of particular nucleic acids or protein products are well known to those of 
skill in the art Similarly, binding assays and reporter gene assays are similarly well known. 
Thus, e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 
acid binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate procedures, including sample and reagent pipetting, liquid dispensing, 
timed incubations, and final readings of the microplate in detectors) appropriate for the 
assay. These configurable systems provide high throughput and rapid start up as well as a 
high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used, hi 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 



58 



WO 02/068677 



PCT/US02/06001 



Li a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that the nucleic acid or peptide 
consists of essentially random sequences of nucleotides and amino acids, respectively. Since 
these random peptides (or nucleic acids, discussed below) are often chemically synthesized, 
they may incorporate any nucleotide or amino acid at any position. The synthetic process can 
be designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. In a preferred embodiment, the nucleotides or amino acid 
residues are randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic 
residues, sterically biased (either small or large) residues, towards the creation of nucleic acid 
binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, 
serines, threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of metastatic colorectal cancer can also be nucleic acids, as 

defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

After a candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence is analyzed. If required, the 
target sequence is prepared using known techniques. For example, the sample may be treated 
to lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 
with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 



59 



WO 02/068677 



PCT/US02/06001 



In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise 
"sandwich assays'*, which include the use of multiple probes, as is generally outlined in U.S. 
Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 
5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, 
all of which are hereby incorporated by reference. In this embodiment, in general, the target 
nucleic acid is prepared as outlined above, and then added to the biochip comprising a 
plurality of nucleic acid probes, under conditions that allow the fonnation of a hybridization 
complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allow fonnation of the label probe 
hybridization complex only in the presence of target Stringency can be controlled by 
altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, fonnamide concentration, salt concentration, chaotropic salt concentration, pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
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assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the metastatic colorectal 
cancer phenotype. In one embodiment, screening is performed to identify modulators that 
can induce or suppress a particular expression profile, thus preferably generating the 
associated phenotype. In another embodiment, e.g., for diagnostic applications, having 
identified differentially expressed genes important in a particular state, screens can be 
performed to identify modulators that alter expression of individual genes. In an another 
embodiment, screening is performed to identify modulators that alter a biological function of 
the expression product of a differentially expressed gene. Again, having identified the 
importance of a gene in a particular state, screens are performed to identify agents that bind 
and/or modulate the biological activity of the gene product, or evaluate genetic 
polymorphisms. 

Genes can be screened for those that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a metastatic colorectal 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
metastatic colorectal cancer gene expression profile so as to mimic the expression of the gene 
from normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent Comparing expression profiles between 
normal tissue and agent treated metastatic colorectal cancer tissue reveals genes that are not 
expressed in normal tissue or metastatic colorectal cancer tissue, but are expressed in agent 
treated tissue. These agent-specific sequences can be identified and used by methods 
described herein for metastatic colorectal cancer genes or proteins. In particular these 
sequences and the proteins they encode find use in marking or identifying agent treated cells. 
In addition, antibodies can be raised against the agent induced proteins and used to target 
novel therapeutics to the treated metastatic colorectal cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
metastatic colorectal cancer cells, that have an associated metastatic colorectal cancer 
expression profile. By "administration" or "contacting" herein is meant that the candidate 
agent is added to the cells in such a manner as to allow the agent to act upon the cell, whether 
by uptake and intracellular action, or by action at the cell surface. In some embodiments, 
nucleic acid encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral 
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construct such as an adenoviral or retroviral construct, and added to the cell, such that 
expression of the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene 
therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., metastatic colorectal cancer tissue may be screened for agents that 
modulate, e.g., induce or suppress the metastatic colorectal cancer phenotype. A change in at 
least one gene, preferably many, of the expression profile indicates that the agent has an 
effect on metastatic colorectal cancer activity. By defining such a signature for the metastatic 
colorectal cancer phenotype, screens for new drugs that alter the phenotype can be devised. 
With this approach, the drug target need not be known and need not be represented in the 
original expression screening platform, nor does the level of transcript for the target protein 
need to change. 

Measure of metastatic colorectal cancer polypeptide activity, or of metastatic 
colorectal cancer or the metastatic colorectal cancer phenotype can be performed using a 
variety of assays. For example, the effects of the test compounds upon the function of the 
metastatic polypeptides can be measured by examining parameters described above. A 
suitable physiological change that affects activity can be used to assess the influence of a test 
compound on the polypeptides of this invention. When the functional consequences are 
determined using intact cells or animals, one can also measure a variety of effects such as, in 
the case of metastatic colorectal cancer associated with tumors, tumor growth, tumor 
metastasis, neovascularization, hormone release, transcriptional changes to both known and 
uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as cell 
growth or pH changes, and changes in intracellular second messengers such as cGMP. In the 
assays of the invention, mammalian metastatic colorectal cancer polypeptide is typically 
used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a colorectal cancer polypeptide is first contacted with a potential 
modulator and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one 
embodiment, the metastatic colorectal cancer polypeptide levels are determined in vitro by 
measuring the level of protein or mRNA. The level of protein is measured using 
immunoassays such as western blotting, ELISA and the like with an antibody that selectively 
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binds to the metastatic colorectal cancer polypeptide or a fragment thereof. For measurement 
of mRNA, amplification, e.g., using PCR, LCR, or hybridization assays, e.g., northern 
hybridization, RNAse protection, dot blotting, are preferred. The level of protein or mRNA 
is detected using directly or indirectly labeled detection agents, e.g., fluorescently or 
radioactively labeled nucleic acids, radioactively or enzymatically labeled antibodies, and the 
like, as described herein. 

Alternatively, a reporter gene system can be devised using the metastatic 
colorectal cancer protein promoter operably linked to a reporter gene such as luciferase, green 
fluorescent protein, CAT, or (J-gal. The reporter construct is typically transfected into a cell. 
After treatment with a potential modulator, the amount of reporter gene transcription, 
translation, or activity is measured according to standard techniques known to those of skill 
in the art 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "metastatic colorectal 
cancer proteins." The metastatic colorectal cancer protein may be a fragment, or 
alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the metastatic colorectal cancer proteins can be used in the 
assays. 

Thus, in a preferred embodiment, the methods comprise combining a 
metastatic colorectal cancer protein and a candidate compound, and determining the binding 
of the compound to the metastatic colorectal cancer protein. Preferred embodiments utilize 
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the human metastatic colorectal cancer protein, although other mammalian proteins may also 
be used, e.g., for the development of animal models of human disease. In some embodiments, 
as outlined herein, variant or derivative metastatic colorectal cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the metastatic 
colorectal cancer protein or the candidate agent is non-diffusably bound to an insoluble 
support having isolated sample receiving areas (e.g., a microliter plate, an array, etc.). The 
insoluble supports may be made of any composition to which the compositions can be bound, 
is readily separated from soluble material, and is otherwise compatible with the overall 
method of screening. The surface of such supports may be solid or porous and of any 
convenient shape. Examples of suitable insoluble supports include microliter plates, arrays, 
membranes and beads. These are typically made of glass, plastic (e.g., polystyrene), 
polysaccharides, nylon or nitrocellulose, teflon™, etc. Microliter plates and arrays are 
especially convenient because a large number of assays can be carried out simultaneously, 
using small amounts of reagents and samples. The particular manner of binding of the 
composition is not crucial so long as it is compatible with the reagents and overall methods of 
the invention, maintains the activity of the composition and is nondifilisable. Preferred 
methods of binding include the use of antibodies (which do not sterically block either the 
ligand binding site or activation sequence when the protein is bound to the support), direct 
binding to "sticky** or ionic supports, chemical crosslinlring, the synthesis of the protein or 
agent on the surface, etc. Following binding of the protein or agent, excess unbound material 
is removed by washing. The sample receiving areas may then be blocked through incubation 
with bovine serum albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the metastatic colorectal cancer protein is bound to 
the support, and a test compound is added to the assay. Alternatively, the candidate agent is 
bound to the support and the metastatic colorectal cancer protein is added. Novel binding 
agents include specific antibodies, non-natural binding agents identified in screens of 
chemical libraries, peptide analogs, etc. Of particular interest are screening assays for agents 
that have a low toxicity for human cells. A wide variety of assays may be used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays for protein binding, functional assays (phosphorylation assays, 
etc.) and the like. 

The determination of the binding of the test modulating compound to the 
metastatic colorectal cancer protein may be done in a number of ways. In a preferred 
embodiment, the compound is labeled, and binding determined directly, e.g., by attaching all 
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or a portion of the metastatic colorectal cancer protein to a solid support, adding.a labeled 
candidate, agent (e.g., a fluorescent label), washing off excess reagent, and determining 
whether the label is present on the solid support. Various blocking and washing steps may be 
utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., l25 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a metastatic colorectal cancer protein), such as an antibody, peptide, binding 
partner, ligand, etc. Under certain circumstances, there may be competitive binding between 
the compound and the binding moiety, with the binding moiety displacing the compound. In 
one embodiment, the test compound is labeled Either the compound, or the competitor, or 
both, is added first to the protein for a time sufficient to allow binding, if present 
Incubations may be performed at a temperature which facilitates optimal activity, typically 
between 4 and 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high 
throughput screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is 
generally removed or washed away. The second component is then added, and the presence 
or absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 
to the metastatic colorectal cancer protein and thus is capable of binding to, and potentially 
modulating, the activity of the metastatic colorectal cancer protein. In this embodiment, 
either component can be labeled. Thus, e.g., if the competitor is labeled, the presence of label 
in the wash solution indicates displacement by the agent Alternatively, if the test compound 
is labeled, the presence of the label on the support indicates displacement 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the metastatic colorectal cancer 
protein with a higher affinity. Thus, if the test compound is labeled, the presence of the label 
on the support, coupled with a lack of competitor binding, may indicate that the test 
compound is capable of binding to the metastatic colorectal cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the metastatic colorectal cancer 
proteins. In this embodiment, the methods comprise combining a metastatic colorectal cancer 
protein and a competitor in a first sample. A second sample comprises a test compound, a 
metastatic colorectal cancer protein, and a competitor. The binding of the competitor is 
determined for both samples, and a change, or difference in binding between the two samples 
indicates the presence of an agent capable of binding to the metastatic colorectal cancer 
protein and potentially modulating its activity. That is, if the binding of the competitor is 
different in the second sample relative to the first sample, the agent is capable of binding to 
the metastatic colorectal cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native metastatic colorectal cancer protein, but cannot bind to modified metastatic 
colorectal cancer proteins. The structure of the metastatic colorectal cancer protein may be 
modeled, and used in rational drug design to synthesize agents that interact with that site. 
Drug candidates that affect the activity of a metastatic colorectal cancer protein are also 
identified by screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a metastatic colorectal cancer protein. The 
methods comprise adding a test compound, as defined above, to a cell comprising metastatic 
colorectal cancer proteins. Preferred cell types include almost any cell. The cells contain a 
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recombinant nucleic acid that encodes a metastatic colorectal cancer protein. In a preferred 
embodiment, a library of candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate metastatic colorectal cancer agents are 
identified. Compounds with pharmacological activity are able to enhance or interfere with 
the activity of the metastatic colorectal cancer protein. Once identified, similar structures are 
evaluated to identify critical structural feature of the compound. 

In one embodiment, a method of inhibiting metastatic colorectal cancer cell 
division is provided. The method comprises administration of a metastatic colorectal cancer 
inhibitor. In another embodiment, a method of inhibiting metastatic colorectal cancer is 
provided. The method comprises administration of a metastatic colorectal cancer inhibitor. 
In a further embodiment, methods of treating cells or individuals with metastatic colorectal 
cancer are provided. The method comprises administration of a metastatic colorectal cancer 
inhibitor. 

A variety of cell growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of metastatic colorectal cancer sequences, which when expressed in host cells, 
inhibit abnormal cellular proliferation and transformation. A therapeutic compound would 
reduce or eliminate the host cells' ability to grow in stirred suspension culture or suspended 
in semi-solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd eA, 1994), 
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herein incorporated by reference. See also, the methods section of Garkavtsev et al. (1996), 
supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (^-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a metastatic colorectal cancer-associated sequence and are grown for 24 
hours at saturation density in non-limiting medium conditions. The percentage of cells 
labeling with ( 3 H)-thymidine is determined autoradiographically. See, Freshney (1994), 
supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
counterparts (see, e.g. f Temin, J. Natl Cancer Insti. 37:167-175 (1966); Eagle et al, J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis t tumor vascularization, and potential interference with tumor growth, 
in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
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angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al , I Biol Chem. 249:4295-4305 (1974); 
Strickland & Beers, J. Biol Chem. 251:5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130(1985). 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate metastatic colorectal 
cancer-associated sequences. Tumor cells exhibit a good correlation between malignancy 
and invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and distance 
moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g, Freshney (1984), supra. 

Tumor growth in vivo 

Effects of metastatic colorectal cancer-associated sequences on cell growth 
can be tested in transgenic or immune-suppressed mice. Knock-out transgenic mice can be 
made, in which the metastatic colorectal cancer gene is disrupted or in which a metastatic 
colorectal cancer gene is inserted. Knock-out transgenic mice can be made by insertion of a 
marker gene or other heterologous gene into the endogenous metastatic colorectal cancer 
gene site in the mouse genome via homologous recombination. Such mice can also be made 
by substituting the endogenous metastatic colorectal cancer gene with a mutated version of 
the metastatic colorectal cancer gene, or by mutating the endogenous metastatic colorectal 
cancer gene, e.g., by exposure to carcinogens. 
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A DNA construct is introduced into the nuclei of embiyonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi et al, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al , Manipulating the Mouse Embryo: A Laboratory Manual, 
Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g, Giovaneila et al, J. 
Natl Cancer Inst. 52:921 (1974)), a SOD mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley et al, Br. 1 Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not In hosts which developed invasive tumors, cells 
expressing a metastatic colorectal cancer-associated sequences are injected subcutaneously. 
After a suitable length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 
statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. Additionally, human tumor cells expressing the genes of the invention may be 
injected into immune compromised animals. Growth of these tumors, or xenografts, is 
compared to growth of similar human tumor cell that do not express the genes of the 
invention. These animals may also be used to binding assays and efficacy studies for 
therapeutic compounds that modulate metastatic colorectal cancer, such as antibodies or 
small molecules. 

Polynucleotide modulators of metastatic colorectal cancer 

Antisense Polynucleotides 

In certain embodiments, the activity of a metastatic colorectal cancer- 
associated protein is downregulated, or entirely inhibited, by the use of antisense 
polynucleotide, i.e., a nucleic acid complementary to, and which can preferably hybridize 
specifically to, a coding mRNA nucleic acid sequence, e.g., a metastatic colorectal cancer 
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protein mRNA, or a subsequence thereof. Binding of the antisense polynucleotide to the 
mRNA reduces the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturaUy-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the metastatic colorectal 
cancer protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, 
MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for metastatic colorectal cancer molecules. A 
preferred antisense molecule is for a metastatic colorectal cancer sequence in Tables 1-26, or 
for a ligand or activator thereof. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment generally at least about 14 nucleotides, preferably 
from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, e.g., 
Stein & Cohen (Cancer Res. 48:2659 (1988 and van der Krol et al (BioTechniques 6:958 
(1988)). 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of metastatic colorectal cancer-associated nucleotide sequences. A 
ribozyme is an RNA molecule that catalytically cleaves other RNA molecules. Different 
kinds of ribozymes have been described, including group I ribozymes, hammerhead 
ribozymes, hairpin ribozymes, RNase P, and axhead ribozymes (see, e.g>, Castanotto et al, 
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Adv. in Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
No. 5,254,678. Methods of preparing are well known to those of skill in the art {see, e.g t 
WO 94/26877; Ojwang et al., Proc. Natl Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al, Human Gene Therapy 1:39-45 (1994); Leavitt et al, Proc. Natl Acad. Set USA 92:699- 
703 (1995); Leavitt et al, Human Gene Therapy 5:1 151-120 (1994); and Yamada et al, 
Virology 205: 121-126 (1994)), 

Polynucleotide modulators of metastatic colorectal cancer may be introduced 
into a cell containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules include, 
but are not limited to, cell surface receptors, growth factors, other cytokines, or other ligands 
that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule 
does not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
metastatic colorectal cancer may be introduced into a cell containing the target nucleic acid 
sequence, e.g., by formation of an polynucleotide-lipid complex, as described in WO 
90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to methods of 
treatment. 

Thus, in one embodiment, methods of modulating metastatic colorectal cancer 
in cells or organisms are provided. In one embodiment, the methods comprise administering 
to a cell an anti-metastatic colorectal cancer antibody that reduces or eliminates die biological 
activity of an endogenous metastatic colorectal cancer protein. Alternatively, the methods 
comprise administering to a cell or organism a recombinant nucleic acid encoding a 
metastatic colorectal cancer protein. This may be accomplished in any number of ways. In a 
preferred embodiment, e.g., when the metastatic colorectal cancer sequence is down- 
regulated in metastatic colorectal cancer, such state may be reversed by increasing the 
amount of metastatic colorectal cancer gene product in the cell. This can be accomplished, 
e.g., by overexpressing the endogenous metastatic colorectal cancer gene or administering a 
gene encoding the metastatic colorectal cancer sequence, using known gene-therapy 
techniques. In a preferred embodiment, the gene therapy techniques include the 
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incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 
as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 
Alternatively, e.g., when the metastatic colorectal cancer sequence is up-regulated in 
metastatic colorectal cancer, the activity of the endogenous metastatic colorectal cancer gene 
is decreased, e.g., by the administration of a metastatic colorectal cancer antisense nucleic 
acid. 

In one embodiment, the metastatic colorectal cancer proteins of the present 
invention may be used to generate polyclonal and monoclonal antibodies to metastatic 
colorectal cancer proteins. Similarly, the metastatic colorectal cancer proteins can be 
coupled, using standard technology, to affinity chromatography columns. These columns 
may then be used to purify metastatic colorectal cancer antibodies useful for production, 
diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies are generated 
to epitopes unique to a metastatic colorectal cancer protein; that is, the antibodies show little 
or no cross-reactivity to other proteins. The metastatic colorectal cancer antibodies may be 
coupled to standard affinity chromatography columns and used to purify metastatic colorectal 
cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the metastatic colorectal cancer protein. 

Methods of identifying variant metastatic colorectal cancer-associated sequences 

Without being bound by theory, expression of various metastatic colorectal 
cancer sequences is correlated with metastatic colorectal cancer. Accordingly, disorders 
based on mutant or variant metastatic colorectal cancer genes may be determined. In one 
embodiment, the invention provides methods for identifying cells containing variant 
metastatic colorectal cancer genes, e.g., determining all or part of the sequence of at least one 
endogenous metastatic colorectal cancer genes in a cell. This may be accomplished using 
any number of sequencing techniques. In a preferred embodiment, fee invention provides 
methods of identifying the metastatic colorectal cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one metastatic colorectal cancer gene of the 
individual. This is generally done in at least one tissue of the individual, and may include the 
evaluation of a number of tissues or different samples of the same tissue. The method may 
include comparing the sequence of the sequenced metastatic colorectal cancer gene to a 
known metastatic colorectal cancer gene, i.e., a wild-type gene. 

The sequence of all or part of the metastatic colorectal cancer gene can then be 
compared to the sequence of a known metastatic colorectal cancer gene to determine if any 
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differences exist. This can be done using any number of known homology programs, such as 
Bestfit, etc. In a preferred embodiment, the presence of a difference in the sequence between 
the metastatic colorectal cancer gene of the patient and the known metastatic colorectal 
cancer gene correlates with a disease state or a propensity for a disease state, as outlined 
herein. 

In a preferred embodiment, the metastatic colorectal cancer genes are used as 
probes to determine the number of copies of the metastatic colorectal cancer gene in the 
genome. 

In another preferred embodiment, the metastatic colorectal cancer genes are 
used as probes to determine the chromosomal localization of the metastatic colorectal cancer 
genes. Information such as chromosomal localization finds use in providing a diagnosis or 
prognosis in particular when chromosomal abnormalities such as translocations, and the like 
are identified in the metastatic colorectal cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a metastatic colorectal 
cancer protein or modulator thereof, is administered to a patient. By "therapeutically 
effective dose" herein is meant a dose that produces effects for which it is administered. The 
exact dose will depend on the purpose of the treatment, and will be ascertainable by one 
skilled in the art using known techniques (e.g., Ansel et al., Pharmaceutical Dosage Forms 
and Drug Delivery; Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, 
ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and 
Technology of Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations 
(1999)). As is known in the art, adjustments for metastatic colorectal cancer degradation, 
systemic versus localized delivery, and rate of new protease synthesis, as well as the age, 
body weight, general health, sex, diet, time of administration, drug interaction and the 
severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the metastatic colorectal cancer proteins and modulators 
thereof of the present invention can be done in a variety of ways as discussed above, 
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including, but not limited to, orally, subcutaneously, intravenously, intranasally, 
transdermally, intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or 
intraocularly. In some instances, e.g., in the treatment of wounds and inflammation, the 
metastatic colorectal cancer proteins and modulators may be directly applied as a solution or 
spray. 

The pharmaceutical compositions of the present invention comprise a 
metastatic colorectal cancer protein in a form suitable for administration to a patient. In the 
preferred embodiment, the pharmaceutical compositions are in a water soluble form, such as 
being present as pharmaceutically acceptable salts, which is meant to include both acid and 
base addition salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that 
retain the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that metastatic colorectal cancer protein modulators (e.g., 
antibodies, antisense constructs, ribozymes, small organic molecules, etc.) when administered 
orally, should be protected from digestion. It is also recognized that, after delivery to other 
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sites in the body (e.g., circulatory system, lymphatic system, or the tumor site) the metastatic 
colorectal cancer modulators of the invention may need to be protected from excretion, 
hydrolisis, proteolytic digestion or modification, or detoxification by the liver. In all these 
cases, protection is typically accomplished either by complexing the molecule(s) with a 
composition to render it resistant to acidic and enzymatic hydrolysis, or by packaging the 
molecule(s) in an appropriately resistant carrier, such as a liposome or a protection barrier or 
by modifying the molecular size, weight, and/or charge of the modulator. Means of 
protecting agents from digestion degradation, and excretion are well known in the art. 

The compositions for administration will commonly comprise a metastatic 
colorectal cancer protein modulator dissolved in a pharmaceutically acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers can be used, e.g., buffered saline 
and the like. These solutions are sterile and generally free of undesirable matter. These 
compositions may be sterilized by conventional, well known sterilization techniques. The 
compositions may contain pharmaceutically acceptable auxiliary substances as required to 
approximate physiological conditions such as pH adjusting and buffering agents, toxicity 
adjusting agents and the like, e.g., sodium acetate, sodium chloride, potassium chloride, 
calcium chloride, sodium lactate and the like. The concentration of active agent in these 
formulations can vary widely, and will be selected primarily based on fluid volumes, 
viscosities, body weight and the like in accordance with the particular mode of administration 
selected and the patient's needs (e.g., Remington 's Pharmaceutical Science (15th ed, 1980) 
and Goodman & Gillman, The Pharmacologial Basis of Therapeutics (Hardman et al. 9 eds., 
1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to lOmg per patient per day. Dosages from 0.1 up to about lOOmgper 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenteraUy administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 

The compositions containing modulators of metastatic colorectal cancer 
proteins can be administered for therapeutic or prophylactic treatments. In therapeutic 
applications, compositions are administered to a patient suffering from a disease (e.g., a 
cancer) in an amount sufficient to cure or at least partially arrest the disease and its 
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complications. An amount adequate to accomplish this is defined as a 'therapeutically 
effective dose." Amounts effective for this use will depend upon the severity of the disease 
and the general state of the patient's health. Single or multiple administrations of the 
compositions may be administered depending on the dosage and frequency as required and 
tolerated by the patient. In any event, the composition should provide a sufficient quantity of 
the agents of this invention to effectively treat the patient. An amount of modulator that is 
capable of preventing or slowing the development of cancer in a mammal is referred to as a 
"prophylactically effective dose." The particular dose required for a prophylactic treatment 
will depend upon the medical condition and history of the mammal, the particular cancer 
being prevented, as well as other factors such as age, weight, gender, administration route, 
efficiency, etc. Such prophylactic treatments may be used, e.g., in a mammal who has 
previously had cancer to prevent a recurrence of the cancer, or in a mammal who is suspected 
of having a significant likelihood of developing cancer. 

It will be appreciated that the present metastatic colorectal cancer protein- 
modulating compounds can be administered alone or in combination with additional 
metastatic colorectal cancer modulating compounds or with other therapeutic agent, e.g., 
other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-26, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of metastatic colorectal cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 (Berger), Ausubel et aL, eds., Current Protocols (supplemented through 1999), 
and Sambrook et aL, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, metastatic colorectal cancer proteins and 
modulators are administered as therapeutic agents, and can be formulated as outlined above. 
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Similarly, metastatic colorectal cancer genes (including both the foil-length sequence, partial 
sequences, or regulatory sequences of the metastatic colorectal cancer coding regions) can be 
administered in a gene therapy application. These metastatic colorectal cancer genes can 
include antisense applications, either as gene therapy (i.e., for incoiporation into the genome) 
or as antisense compositions, as will be appreciated by those in the art. 

Metastatic colorectal cancer polypeptides and polynucleotides can also be 
administered as vaccine compositions to stimulate HTL, CTL and antibody responses.. Such 
vaccine compositions can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al, J. Clin, 
Invest 95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) 
("PLG") microspheres (see, e.g., Eldridge, et al, Molec. Immunol. 28:287-294, (1991); 
Alonso etal, Vaccine 12:299-306 (1994); Jones et al, Vaccine 13:675-681 (1995)), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et 
al, Nature 344:873-875 (1990); Hue/ al, Clin Exp Immunol 113:235-243 (1998)), multiple 
antigen peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl Acad. Sci. U.S.A. 85:5409-5413 
(1988); Tarn, /. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent 
peptides; peptides for use in ballistic delivery systems, typically crystallized peptides, viral 
delivery vectors (Perkus, et al, In: Concepts in vaccine development (Kaufinann, ed., p. 379, 
1996); Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, 
et al, AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect Dis. 124:148 (1971); Chanda 
et al, Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et al, 
J. Immunol. Methods. 192:25 (1996); Eldridge et al, Sem. Hematol. 30:16 (1993); Falo et al, 
Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 
Gupta a/., Vaccine 11:293 (1993)), liposomes (Reddy et al, J. Immunol 148:1585(1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al, 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
Immunol 12:923 (1994) and Eldridge et al, Sem. Hematol. 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
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MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationicaliy or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof; is administered to a 
patient This approach is described, for instance, in Wolff et. ai 9 Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g. t U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode metastatic 
colorectal cancer polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an 
immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described in Stover et al. 9 Nature 351 :456-460 (1 991). A wide 
variety of other vectors useful for therapeutic administration or immunization e.g., adeno and 
adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 
anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 
description herein (see, e.g. t Shata et al. 9 Mol Med Today 6:66-71 (2000); Shedlock etal.,J 
Leukoc Biol 68:793-806 (2000); Hipp et aL 9 In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a metastatic colorectal cancer gene or portion of a metastatic colorectal cancer gene 
under the control of a regulatable promoter or a tissue-specific promoter for expression in a 
metastatic colorectal cancer patient. The metastatic colorectal cancer gene used for DNA 
vaccines can encode full-length metastatic colorectal cancer proteins, but more preferably 
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encodes portions of the metastatic colorectal cancer proteins including peptides derived from 
the metastatic colorectal cancer protein. In one embodiment, a patient is immunized with a 
DNA vaccine comprising a plurality of nucleotide sequences derived from a metastatic 
colorectal cancer gene. For example, metastatic colorectal cancer-associated genes or 
sequence encoding subfragments of a metastatic colorectal cancer protein are introduced into 
expression vectors and tested for their immunogenicity in the context of Class I MHC and an 
ability to generate cytotoxic T cell responses. This procedure provides for production of 
cytotoxic T cell responses against cells which present antigen, including intracellular 
epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the metastatic colorectal cancer polypeptide encoded 
by the DNA vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment metastatic colorectal cancer genes find use in 
generating animal models of metastatic colorectal cancer. When the metastatic colorectal 
cancer gene identified is repressed or diminished in metastatic tissue, gene therapy 
technology, e.g., wherein antisense RNA directed to the metastatic colorectal cancer gene 
will also diminish or repress expression of the gene. Animal models of metastatic colorectal 
cancer find use in screening for modulators of a metastatic colorectal cancer-associated 
sequence or modulators of metastatic colorectal cancer. Similarly, transgenic animal 
technology including gene knockout technology, e.g., as a result of homologous 
recombination with an appropriate gene targeting vector, will result in the absence or 
increased expression of the metastatic colorectal cancer protein. When desired, tissue- 
specific expression or knockout of the metastatic colorectal cancer protein may be necessary. 

It is also possible that the metastatic colorectal cancer protein is overexpressed 
in metastatic colorectal cancer. As such, transgenic animals can be generated that 
overexpress the metastatic colorectal cancer protein. Depending on the desired expression 
level, promoters of various strengths can be employed to express the transgene. Also, the 
number of copies of the integrated transgene can be determined and compared for a 
determination of the expression level of the transgene. Animals generated by such methods 
find use as animal models of metastatic colorectal cancer and are additionally useful in 
screening for modulators to treat metastatic colorectal cancer. 
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Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, metastatic colorectal cancer- 
specific nucleic acids or antibodies, hybridization probes and/or primers, antisense 
polynucleotides, ribozymes, dominant negative metastatic colorectal cancer polypeptides or 
polynucleotides, small molecules inhibitors of metastatic colorectal cancer-associated 
sequences etc. A therapeutic product may include sterile saline or another phannaceutically 
acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
metastatic colorectal cancer-associated sequences. Such kits can be prepared from readily 
available materials and reagents. For example, such kits can comprise one or more of the 
following materials: a metastatic colorectal cancer-associated polypeptide or polynucleotide, 
reaction tubes, and instructions for testing metastatic colorectal cancer-associated activity. 
Optionally, the kit contains biologically active metastatic colorectal cancer protein. A wide 
variety of kits and components can be prepared according to the present invention, depending 
upon the intended user of the kit and the particular needs of the user. Diagnosis would 
typically involve evaluation of a plurality of genes or products. The genes will be selected 
based on correlations with important parameters in disease which may be identified in 
historical or outcome data. 
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Table 1 



Pkey: Unique Bos prcbesct identifier number 

ExAcau Exemplar Accession cumber, Genbank secession number 

UnigenelD: Unigeno number 

Unigene Title: Unigene gene title 



Pkey ExAccn UnigenelD Unigene Title 



Ratio BS_Mets Top 3 expressing cell lines 



103989 
101169 
101880 
129462 
131676 
131861 
118823 
101107 



102306 
126419 
101198 
107652 
128145 
110660 
111669 
124867 
127352 
130736 
128592 
108092 
133373 
100572 
115775 
120811 
111919 
117009 
101124 
108151 
134733 
131739 
116311 
134174 
106753 
104842 
129161 
105675 
100547 
116857 
113222 
118768 
114542 
101640 
107754 
104668 
135377 
127083 
102329 
117882 
126405 
131378 
111418 



AA314779 

L15533 

M97925 

D84239 

C20785 

D11925 

N79237 

L08010 

Y00339 

U33317 

AA451775 

L21998 

AA010195 



H82117 
R19305 



AA416577 

T99385 

AA470056 

AA0459S1 

S72487 

HG2271 

AA424030 

AA346854 

R39926 

H85422 

L10343 

AA424958 

U03644 

AA449749 



108710 

105437 
103448 
130436 
112309 
103211 
109012 



113466 
103029 



U05259 

AA476944 

AA039854 

N27334 

AA284767 

HG2149 

H65841 

T59670 

N74467 

AA055768 

M58459 

AA017462 

AA007312 

C21382 

Z44079 

U35407 

N50101 

U46278 

AA463886 

R01084 

M194075 

AA121960 

AA252191 

X99133 

M84526 

R55021 

X73079 

AA156576 

AF005887 

TB6945 

X54489 



Hs.105484 

Hs.423 

Hs.72887 

Hs.111732 

Hs30514 

Hs.184245 

Hs£0813 

Hs.4158 

Hs.155097 

Hs.711 

Hs.129064 

Hs315 

HS52642 

Hs.166669 

Hs.28043 

Hs.1 10347 

Hs.168500 

Hs.189105 

Hs.18646 

Hs.113994 

Hs.169355 

Hs.73946 

Hs.46627 

Hs.52788 

Hs£1031 

Hs.108556 

Hs.1 12341 

Hs.33735 

Hs.89421 

Hs.31386 

Hs.48752 

Hs.79630 

Hs.7331 

Hs.8065 

Hs.181780 

H&252808 

Hs,186550 

Hs.10615 

Hs.94304 

Hs.122576 

Hs.1 80911 

Hs.187571 

Hs.183852 

Hs.99766 

Hs.91608 

Hs.158084 

Hs.124724 

Hs.122489 

Hs.203910 

Hs.19081 

Ue QQQflft 



ESTs; WeaWysimflartoUTHOSTATHINE 1 



defensh; alpha 5; Panetii cetkpecific 

IgG Fc binding protein 

ESTs 

K1AAQ929 protein Max2 interacting rujctea 

ESTs; Weakly similar to long chain fatty 

regenerating islet-derived 1 beta (pancr 

carbonic anhydrase II 

defensin; alpha 6; Paneth celkpecific 

H sapiens chromosome 19; cosmid F22162 

mucin % intestmal/tracheal 

ESTs; Weakly similar to llll ALU CLASS F 

ESTs; Weakly similar to sodium btcarbona 

ESTs 

H sapiens mRNA for alpha integrin bindin 

ESTs 

ESTs 

EST 

ESTs; Weakly similar to alternatively sp 



15.77 

1158 

9.24 

8.57 1 

7.43 

7.15 

6.72 

6.33 

6.18 

5.67 

5.14 

5.1 

4.94 

4.77 

4.54 

4.52 

4.5 

4.41 

4.29 

4.18 



ESTs; Weakly similar to TRANSCRIPTION RE 

endotheBal ceil growth factor 1 (piatel 4.03 

ProfDaggrin 4.03 

ESTs 4.02 

fragile X mental retardation; autosomal 4.01 

ESTs 358 

ESTs 3.97 

protease inhfoltor 3; skiiwterived (SKAL 3.89 

ESTs 3.88 

CBF1 interacting compressor 3.88 

ESTs; Highly similar to secreted apoptos 3.87 

ESTs 3.84 

CD79A antigen (imniurujgtorxilirhassociated 3.83 

ESTs 3.82 

H sapiens mRNA full length insert cDNA c 3.78 

ESTs 3.75 

ESTs; Highly similar to pulmonary suriac 3.75 

Mucin (Gb:M57417) 3.75 

ESTs 3.73 

ESTs 3.7 

EST 3.68 

ESTs 3.66 

rtbosomal protein S4; Y-linked 3.62 

ESTs 3.6 

ESTs; Weakly similar to polymerase [H.sa 3.58 



Hs.25199 

Hs.204238 

Hs.155597 



H sapiens mRNA; cDNA DKF2p564J0323 (from 

otoferDn 353 

peroxisome receptor 1 351 

ESTs; Weakly similar to coded far by C. 3.47 

ESTs 3.46 

smali glutamirie-rich tetratricopeptide r 3.45 

ESTs 3.43 

nuclear receptor coactivator 4 14 
zm24g9.s1 Stratagene pancreas (#93728) H 

mRNAseq 3.4 

ESTs; Highly similar to match to ESTs AA 338 

Dpocafin 2 (oncogene 24p3) 338 

D component of complement (adipsin) 337 

y{76d5.s1 Soares breast 2NbHBstHsapIen 336 

polymeric immunoglobulin receptor 335 

ESTs 3.21 

activating transcription factor 6 3.19 

ESTs 3.18 



Hs.205126 
Hs.191466 
Hs.247433 
Hs.16304 

Hs.789 GR01 oncogene (melanoma growth stimulati 3.16 



EB.celis, rfra.cells, HMEC 
HMEC (total RNA), Fibroblasts 2, I 
Fibroblasts 2. MB231.ceDs, MBWA-453 
EB.celis, OVCAfLcelis, HS578T.cells 
HMEC (total RNA), HMEC, Fibroblasts 2 
HMEC, HMEC (total RNA), Fibroblasts 2 
HMEC, HMEC (total RNA), Lu_AD_H23 
BT474_cells. Fibroblasts 2, MB231_ce!ls 
OVCARjJells, MCF7, 293T_ceDs 
Fibroblasts 2, HMEC, HT29 eels 
HS578T_cells, HMEC (total RNA), HMEC 
EB.celis, HT29_ceiis, MB231_cefe 
HMEC (total RNA). HMEC, EB.celis 
HS578T_cefls, HMEC, Lu_SQ_H520 
HMEC, HS578T_ceGs, BT474_cefls 
HMEC, HS578T_cefls ) Caca2 
HMEC, HMEC (total RNA), HS578T_ceiis 
HMEC, HMEC (total RNA), MB4WDA-435S 
HMEC, EB.celis, HMEC (total RNA) 
HMEC (total RNA), HMEC, Rbioblasfs2 
4.04 HMEC (total RNA), HMEC, Fibroblasts 2 
EB.ceBs, HMEC, HMEC (total RNA) 
HMEC (total RNA), HMEC, Fibroblasts 2 . 
HMEC, HMEC (total RNA), EB.ceBs 
HMEC (total RNA), HMEC, Fibroblasts 2 
EB.celis, HMEC (total RNA), HMEC 
HMEC (total RNA), HMEC, Fibroblasts 2 
PC3_ceJts, RPWEJ, Caco2 
EB.ceBs, HMEC, HMEC (total RNA) 
EB.ceDs, HMEC, HMEC (total RNA) 
HS576T.cefls, MB-MDA435S, HT29_ceGs 
HS578T_ceils, HMEC, LNCaP.cells 
DU145.ce0s, LilAD_H23, MB231_ce!ls 
LNCaP.cete, Lu.SC.H345, DLi145„ceQs 
HS578T.cells, A549 cells, CALU&.cefe 
HMEC (total RNA), HMEC, BT474_cefls 
293T_ceIls, PRSC.con, HT29_ceil3 
HMEC (total RNA), HMEC, Fibroblasts 2 
HS578T.cells, 293T_celIs, HMEC 
HMEC, HS578T_celb, Caco2 
HrvEC, HS57BT_ceBs, OVCAR.ceDs 
EB.ceDs, MCF7, LNCaP.cells 
DU145.ceIls, RPWE^, A549_ceBs 
HMEC (total RNA), Fibroblasts 2, Fftroblasts2 
HMEC (total RNA), HMEC, Fibroblasts 2 
3.56 HMEC, HMEC (total RNA), EB.celis 
HMEC (total RNA), HMEC, Fibroblasts 2 
HMEC, HMEC (total RNA), EB.ceQs 
HMEC (total RNA), HMEC, EB.celis 
LNCaP.cells, MCF7, DU145_ceHs 
EB.celis, HMEC, HMEC (total RNA) 
HS578T_ceils, EB.celis, LuJU)_H23 
HS578T_cells, EB.celb, HMEC 

EB.celis, HMEC, HMEC (total RNA) 
EB.celis, LNCaP.cells, RPWEJ 
PC3.cells, EB.celis, HT29.celis 
PRSC.con, EB.celis, Lu_AD_H23 
EB.celis, HMEC, HMEC (total RNA) 
MB231_ceils, HT29.oelIs, Lu_SC_H69 
EB.ceDs, HMEC, HMEC (total RNA) 
HMEC (total RNA), HMEC, Lu_AD_H23 
HMEC, MB231.ceIls r Caco2 
LuJLC.H460, PC3.oells, FB>robiasts2 
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109374 AA218727 Hs,210785 ESTs; Highly similar to Ibd1 [Rsapiens] 3.13 

131403 R55750 Hs.26455 ESTs 3.13 

113420 T83964 Hs.15400 ESTs 3.11 

112532 R69824 Hs28313 ESTs 3.11 

117905 N50782 Hs.231713 EST 3.11 

125349 T87826 Hs.1644fl0 ESTs 3.1 

107072 AA609113 Hs.177533 H sapiens mRNA; cDNA DKFZp586N031B (from 

118389 N64583 

117653 N38970 

101082 L05072 

126105 H75323 

120006 W90108 

127191 AA297581 



Hs.182385 ESTs 
Hs.194214 ESTs 

Hs.80645 Interferon regulatory factor 1 
Hs.167614 ESTs 

Hs, 10848 K1AAQ187 gene product 

EST1 13160 Gall bladder ! H sapiens cONA 
106899 AM90107 Hs^1753 JM5 protein 
112784 R96306 Hs.191290 ESTs 
113613 T93337 Hs.17167 ESTs; Highly similar to LRR FLU tatera 
107631 AA007230 Hs.95026 ESTs 

101923 S75256 HNL=neutrophl! fipocalin [human, ovarian 

100695 H6315T Beta-1-GtycoprotBfn 11, Pregnancy-Specif 

102523 U53445 Hs. 15432 deregulated in ovarian cancer 1 
121588 AA416615 Hs.98242 ESTs 
103714 AA047055 Hs.192943 ESTs 
104916 AA056588 Hs.16542 ESTs 
ESTs 
ESTs, 

coagubtion factor X 
carbonic anhydrase IV 
Hs.109108 solute carrier family 22 (organic cation 
Hs.27038 ESTs; Moderately simBar to X-Onked ret 
ESTs 
ESTs 



109928 H05961 
104586 R78309 
101236 L29433 
134749 L10955 
124703 R07294 
114108 Z38431 
107857 AA024687 Hs.61208 
111586 R10759 Hs.15177 



Hs^6331 
Hs.20787 
Hs.47913 
Hs.89485 



127553 AA282433 



H sapiens p60 kafcanin mRNA; complete cds 



3.05 
3.04 
3.04 
3.03 
3.03 
3.02 
3.02 
3.02 
3.02 
3.02 
3.01 
3.01 
2.98 
2.94 
2.94 
2.93 
2.92 
2.92 
2.91 
2.9 
2.9 
2.89 
2.88 
2.88 
2.87 



129881 AA458952 Hs.197728 ESTs; Weakly similar to ZINC FINGER PROT 2.86 

116852 H65459 Hs.38323 ESTs 2.85 

133468 X03068 Hs.73931 major histocompatibility complex; dass 282 

130998 C00810 Hs.21970 guanine nucleotide binding protein (G pr 2.82 

124075 H05741 Hs.101643 ESTs 2.82 

128108 AI247422 Hs.129966 ESTs 2.82 

128096 R15413 Hs.164919 ESTs; Highly similar to PROTEIN KINASE C 2.8 
126619 Z2B861 HSBA7E032 STRATAGENE Human skeletal muse 

cONA ctone A7EQ3, mRNA seq. 2.77 

114418 AA011383 Hs.177313 ESTs 2.77 

120383 AA228030 Hs. 120234 ESTs 2.77 

126535 H73017 Hs.250723 ESTs; Weakly similar to atrophin-1 relat 2.76 

yc1QdQ8.s1 Stratagene lung (#937210) H s 2.76 

Hs.141438 ESTs; Moderately similartD similar to C 2.76 



119347 T64349 
126219 N36368 
125426 R43963 
103005 X52008 



Hs.169355 ESTs; WeaJdy similar to TRANSCRIPTION RE 



Hs.2700 glycine receptor; alpha 2 2.74 

109170 AA180352 Hs.191472 ESTs 2.74 

101125 L10373 Hs.82749 transmembrane 4 superfamDy member 2 2.73 

130656 Z20481 Hs.17411 WAA0699 protein 273 

122933 AA476728 Hs.107537 ESTs 2.72 

126033 AA055978 Hs.3807 ESTs; Weakly slmflar to PHOSPHOLEMMAN PR 

111644 R16539 Hs.223649 EST; Moderately similar to Cd-7 MstaDo 2.71 

133719 AA033790 Hs.75736 apoBpoprotein D 2.71 

127555 AA582324 Hs.192857 ESTs 2,7 

113321 T70580 Hs.13759 ESTs 2.69 

109326 AA210719 Hs.86414 ESTs 2.68 

135003 H42527 Hs.92832 ESTs 2.68 

103650 Z70220 H.sapiens mRNA for 5UTR for unknown pro 2.68 

111507 R07728 Hs.191218 ESTs 2.67 

117084 H93081 Hs.41829 ESTs 2.67 

103975 AA306264 Hs.176403 ESTs; Moderately similar to 111! ALU SUB 2.67 

132850 R89741 Hs.58215 ESTs; Moderately similar to rtiotekin [M. 2.66 

121599 AA416770 Hs.98255 EST 2.61 

124230 H63111 Hs.6655 ESTs 2.6 

114174 Z39055 Hs.27264 EST s; Moderately similar to I II! ALU SUB 2.58 

128469 T23724 Hs.258677 EST 2.57 

117399 N264B0 Hs.43805 BpomaHMGIC fusion partner-like 3 2.57 

129279 AA460551 Hs.184860 ESTs; Weakly similar to EG:87B1.6 [D.mel 2.57 

119817 W74257 Hs.159690 ESTs 2.57 

114445 AA019594 Hs.250493 ESTs; Weakly similar to KIAA0390 [H.sapl 2.56 

120651 AA287286 Hs.99657 ESTs 2.55 

105707 AA291012 Hs.37617 ESTs; Weakly similar to K1AA0727 protein 255 

128483 T58588 Hs5148 FLN29 gene product Z54 

125890 AA448739 Hs.116708 ESTs; Weakly similar to HYPOTHETICAL PRO 



Caco2, A549_celis, MB231_ceHs 

HS578T.ce!ls, HWEC, MB231.cells 

HMEC (total RNA), HMEC, EB_cells 

HMEC, HMEC (total RNA), EB.cells 

HMEC, HS578T_cells, Caco2 

HS578T.ceHs, EB.cells, MB4UDA435S 

3.1 U.SC.H69, MB-MDA-453, MB231.cells 

HMEC, HMEC, WCaPjsDs 

HMEC, HMEC (total RNA), Fttiroblasts2 

EB.cells, PRSC.con, DU145„ceDs 

HS578T_cells, HMEC (total RNA), HMEC 

HMEC, HMEC (total RNA), EB.cells 

HMEC, LuJU).H23, Lu_SQJi520 

EBLcelb. HMEC (total RNA); HMEC 

EB.cells, HMEC, LuJ\D.358 

HMEC (total RNA), EB.cells, HMEC 

Lu.SC.H345, HS578T_ceHs, UJ.CJW60 

PC3_ce0s, EB.cells, HT29 cells 

Fforobiasts2, Lu_AD_H23, ~MB-MDA-435s 

PRSC.con, Fibroblasts 2, HMEC 

HMEC, HS578T_ceDs, BT474_ceDs 

HS578T.csBs, EB.cells, HMEC 

HMEC (total RNA), Fibroblasts 2, HMEC 

HMEC, MB231_ceils. HS578T_cells 

Caco2, UlAD.358, Lu_ADJ58 

HMEC, HS578T_ce«s, Caco2 

BT474.cefls, MCF7, HMEC (total RNA) 

HMEC, HMEC (totaJ RNA), »MDA435s 

HMEC, HMEC (total RNA), EB.cells 

HS578T.ceHs, MB231_ceBs, HMEC 

HS578T_cells, Lu.LC.H460, PRSC.Con 

EB.cells, MB-MDA-435S, RPWE.2 

EB.cells, PC3.cells, HMEC 

HMEC. Caco2, HS578T_cefe 

MB-MDA-435S, BT474_cells 1 HT29_cells 

LNCaP.cefc, Lu.SC.H345, EB.cells' 

HMEC, HS578T.ceI!s, HT29.ce!ls 

HS578T.celis, Lu.LC.H460, U.SCJI69 

MB231_ce!ls l Lu_AD_H23, RPWEJ 

HMEC, Lu_AD_H23, HMEC (total RNA) 

HS578T.ceils, EB.ceDs, MCF7 

EB.cells, Fibroblasts 2, HMEC (total RNA) 

Fibroblasts 2, PRSC.con. DU145 cells 

EB.cells, Lu_AD_H23, Lu.SC.H69 

hiJtoJW, HMEC (total RNA), MB-MDA435s 

2.75 HWEC, HMEC (total RNA), Lu.SC.tf69 

HS578T_ceQs, HMEC, MB-MDA453 

Fibroblasts 2, HMEC (total RNA), MB-MDA-435S 

LuJ_C_H460, 293T_ceib, EG cells 

HMEC (total RNA), HMEC, Fibroblasts 2 

HMEC, EB.cells, HMEC (tote! RNA) 

2.71 Lu.SCJi345, Lu.SC.H69, 293T.celb 

EB.cells, HMEC, HMEC (total RNA) 

Caco2, Fibroblasts 2, MB-MDA435s 

HMEC, HS578T.celIs, HMEC (total RNA) 

HMEC (total RNA), Fibroblasts 2, PRSC con 

MB-MDA-435S, HS578T.cetls, Lu.SC_H69 

HS578T_ceBs, EB.cells, PRSC.con 

HMEC. HS578T_cel!s, PRSC.con 

HMEC (total RNA), HMEC, EB.cells 

HS578T cells, HMEC, MB231_celfs 

DU145 cells, HS57BT.cells, MB-MDA435s 

HS578T_ce8s, EB.cete, 293T_cefls 

HMEC (total RNA), HMEC, EB.cells 

HMEC (total RNA), HMEC. Fibroblasts 2 

Ceco2, MB-MDA-453, A549.cells 

Lu.LC.H460, Lu.SC.H69, MB-MDA-435S 

HMEC, HMEC (total RNA), EB.cells 

HS578T.ceDs, EB.cells, HT29.ceDs 

HMEC, HMEC (total RNA), Lu_SC_H69 

HMEC, HT29.ceDs, Lu_LC.H460 

HMEC, HMEC (total RNA), Fibroblasts 2 

HMEC (total RNA), EB.cells, BT474_cells 

HMEC, HS578T.C6HS, MB231.ceDs 

254 HMEC (total RNA), HMEC, OVCARjods 
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134764 M74715 Hs.89560 (duranWase; alphal- 2.54 

113404 T82323 Hs.70337 immunoglobulin superfamliy; member 4 2.54 

129128 AM23854 Hs.108812 ESTs £54 

101428 M1S684 Hs. 184929 protease inhibitor 1 (a^pte-1-antitrypsI 2.54 

103206 X72755 Hs.77367 monokine Induced by gamma interferon 2.53 

' 132273 AM89716 Hs.43658 DKFZP58BL151 protein 2.53 
108392 AA075124 zm86a1.s1 Stratagene ovarian cancer (#93 

IMAGE:5447763,mRNAseq 2.52 

119508 W37895 Hs.45519 ESTs 252 

109828 F13763 Hs.19827 ESTs 2.52 

135096 N89775 Hs.132390 zinc finger protein 36 (KOX 18) 2.51 

130860 U66061 Hs^41395 protease; serine; 1 fbypsin 1) 251 

105725 AA292228 Hs.199791 STAT Induced STAT inhibitor 3 2.51 

110427 H48579 Hs.36275 EST 251 

123762 AA610013 Hs.244553 EST 251 
126406 AAO34096 zf06fi05.rt Soares_fetaLiiver_spleen_1NF 

IMAGE:430017 5\ mRNA seq. 25 

129751 AA346065 Hs.111286 K1AA0714 protein 25 

121704 AA418743 Hs38306 ESTs 25 

112595 R77783 Hs,22404 protease; serine; 12 (neurotrypsln; moto 25 
108499 AA083103 zn1b12*1 Stratagene hNT neuron (#937233 

IMAGE:54773\ mRNA seq 25 

131968 AA151333 Hs56029 ESTs; Highly similar to basic helix-loop 25 

112665 R85661 Hs.221447 ESTs 2.46 

115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevis) horn 2.48 

105959 AA405540 HsJ001 ESTs 2.48 

125804 R79519 Hs.16899 ESTs 2.48 

110102 H16681 Hs.180950 guanine nucteotlde btnding protein (Gpr 2.46 

104680 AA009809 Hs57599 ESTs 2.46 

132339 080030 Hs.45127 chondroitin sulfate proteoglycan 5 {neur 2.45 

121712 AA419116 Hs.193663 ESTs; Weakly similar to I!!! ALU SUBFAMI 2.45 

129226 M96843 Hs.180919 inhibitor of DNA binding 2; dominant neg 2.44 

128731 AF005271 Hs.104555 neuropeptide FF-amkte peptide precursor 243 

106670 AA461174 Hs5943 ESTs 2.43 

119306 T26914 Hs.132785 EAP30 subunit of ELL complex 2.43 

133507 X74295 Hs.74369 integrin; alpha 7 2.42 

125713 AA367905 Hs.77356 transferrin receptor (p90;CD71) 2.41 

107438 W27841 Hs.17118 ESTs; Weakly similar to B0025.2 [C.elega 2.41 

101784 M83186 Hs. 114346 cytochrome c oxidase subunit Vila polype 2.41 

134578 AA194724 Hs.182418 endonuctease G - 2.4 

125105 T95642 Hs.189759 ESTs 2.4 

127087 AA380418 Hs58012 SHP2 interacting transmembrane adaptor 2.4 

113118 T47906 Hs.220512 ESTs 259 

104791 AA029046 Hs.30377 ESTs; Moderately similar to cAMP inducfl) 259 

115833 AA428269 Hs. 125035 ESTs 258 

132223 R77451 Hs.4245 ESTs; Weakly simitar to similar to S. ce 258 

115836 AA428863 Hs.89388 ESTs 258 

101891 S45630 Hs.1940 crystafJrn; alpha B 258 

132894 D82422 H$5944 ESTs . 257 

106939 AA496048 Hs£6570 ESTs 255 

131104 W27770 Hs.258721 ESTs 255 

122355 AA443789 Hs.189324 ESTs 254 
119343 TB2873 yc3d2s1 Stratagene lung (#93721 ) H sapi 

to contains Aiu repetitive element, mR 254 

115442 AA284722 Hs.89121 H sapiens mRNA; chromosome 1 specific tr 253 

134286 T69384 Hs.68398 period (Drosophila) homolog 1 253 

125465 AI375276 Hs.158732 ESTs 253 

127449 AI421866 Hs.75722 rfcophorinll 233 

110225 H23927 Hs222381 ESTs 253 

119930 W86471 Hs.151624 hypocrefci (orexin) receptor 2 252 

125958 AI073357 Hs.12311 H sapiens done 23570 mRNA seq 232 

119746 W70279 Hs.221189 ESTs;WeaJdysiriiilarto1541YDROXYPROSTA 

108874 AA134112 Hs.107187 H sapiens DNA seq from cosmid ICK0721Q o 

L12UKE protein in an intron of the HS 232 

127368 AA434362 Hs.193326 ESTs 232 

120437 AA243427 Hs. 104311 ESTs 252 

119867 W60852 Hs250695 KDEL{Lys-Asr>OIu-Leu) endoplasmic retic 252 

131205 J02947 Hs.2420 superoxide dismutase 3; extracellular 2.32 

133710 X76057 Hs.75694 mannose phosphate isomerase 251 

104834 AA039331 Hs. 16323 ESTs; Weakly similar to GAGE-7 [Hapten • 251 

113186 T56048 Hs.189674 ESTs 251 

113462 T86826 Hs.142528 ESTs 251 

104743 AA021157 Hs.33619 ESTs 25 

129667 Y00097 Hs.1 18796 annexinA6 25 

111573 R10305 Hs.185683 ESTs 25 

117523 N32626 Hs.145532 ESTs; Weakly similar to Gag poiyprotein 2.29 



BT474.ceIis, PRSCcon, HT29_oelJs 
Caco2, KS578T_celb, HMEC 
BT474.cells, MB-MDA435S, HMEC 
HMEC, HT29_cefls, HMEC (total RNA) 
Fibroblasts 2, MB231.cei(s, HMEC (total RNA) 
EB_ceiis, HMEC, HMEC (total RNA) 

HMEC (total RNA), HMEC, HS578T_cells 
Lu.SC.H69, CALU6„ceUs, 293T_cells 
PRSCJog, PRSC.con, HS578T.celrs 
HMEC, HS578T_ceHs, HT29_oeBs 
OVCAR.ceQs, MB231_celis J PC3.cells 
HS578T_ceils, HT29.ce0s, HMEC 
HS578T.ceBs, Caco2, Lu.LC_H460 
HMEC (total RNA), HMEC, Fibroblasts 2 

LuJtfLH23, HSoTST.ceBs, LuJU)_358 
HMEC, HS578T_cefe, Fibroblasts 2 
EB.ceBs, HMEC (total RNA), HMEC 
Fibroblasts 2, EB.ceBs, PRSCjoon 

LNCaP.oelis, MB-MOA-453, HMEC 
Fibroblasts 2, A549_ceils, 293T.ceils 
Lu_AD_H23, HMEC, Lu_LC_H460 
EB.ceBs, Caco2, MCF7 
OVCAR^cells, BT474_cells, Caco2 
HMEC (total RNA), EB^ceJJs, HMEC 
HS578T_ceQs, HMEC, OVCARjcells 
HMEC, HS578T_ceils, Caco2 
OVCAR_ceife, 293T.ce&, HMEC (total RNA) 
Lu.SQ_H520, UU\D_H23, Lu.SC.H69 
MB-MDM53, 293T_ceUs, Caco2 
HMEC, HMEC (total RNA), EB.cells 
EB.cells, HS578T_cefe. Lu_SC_H69 
EB.cefls, HMEC (total RNA), HMEC 
Fibroblasts % Caco2, EB.cells 
HS578TJSIIS, Fibroblasts 2, Lu_AD_H23 
HMEC, HS578T.ceDs, MB231_ceIls 
Fibroblasts 2, PRSC.con, PRSCJog 
EB.cefls, HMEC, LlAD_H23 
EB.celis, A549_celb, HS57ffLceils 
HMEC, HMEC (total RNA), EB.cells 
MB-MDA435S, HS578T.ceHs, HMEC 
LNCaP.cefe, OVCARj»Ils, PC3.ceiis 
Caco2, WCaP.ceOs, CALU6_celb 
HMEC, HMEC (total RNA), EB.celis 
HS578T.oeBs, HMEC, PRSC.con 
HS578T.ceils, OVCAR_cells, Lu_LC.H460 
Caco2, M&WDA-453. HT29.celis 
LNCaP.cefls, JB3T.cells, EB.cete 
HMEC (total RNA), HMEC, HT29.cells 
HMEC (total RNA), HMEC, EB.cells 

HS578T_cetls, Lu_SC_H69, HT29_ceils 
LilAD_H23, HMEC (total RNA), BT474.ceHs 
HMEC, HMEC (total RNA), MB231_cete 
HMEC (total RNA), EB.cells, HMEC 
UjAD_H23, HMEC (total RNA), HMEC 
HS578T_cells, HMEC, Lu_LC.H460 
HMEC, HMEC (total RNA), EB.cells 
MB231.cells, HMEC (total RNA), HMEC 
2.32 HMEC, HS578T.ceHs, MB231.ceils 

CaccZ PRSC.con, LMCaP.cells 
HMEC (total RNA), HSbTST.celis, HMEC 
HMEC (total RNA), HMEC, MB-MDA-435S 
Fibroblasts 2, HS578T.cells, M&MDA435s 
PRSC.con, EB.cells, Lujtf)_358 
293TjsUs, LNCaP.cells, RPWEJ2 
Caco2, HS578T.ce!ls, HMEC 
HMEC, Fibroblasts 2, HMEC (total RNA) 
PC3_cefls, HS578T_cefls, HMEC 
HMEC (total RNA), HMEC, OVCAR.cslls 
PRSCJog, PRSC.con, HS578T.ce0s 
HMEC, HMEC (total RNA), EBjslls 
EB.celis, Fibroblasts 2, HS578T_ceRs 
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115540 AA349954 Hs.56281 ESTs; Weakly similar to ASB-1 protein [H 229 
101622 M55621 Hs.151513 mannosyl (alpha-1;3-^gr/coprotein beta- 229 
103535 Y13620 Hs.122607 B<eOCLLflymphoma9 228 
127482 AI337294 Hs.155014 ESTs Z28 
104297 D31111 Hs,106O05 ESTs; Highly similar to NY-REN-50 antige 227 
112318 R55470 Hs.11087 ESTs 227 
101877 M97496 Hs.778 guanytate cyclase activator 1B (retina) 227 
100760 H63576 Major Hfetocompatibttjty Complex, Class Z26 

102362 U39412 Hs.75932 N^thylmalelmld^sansftwe factor attach 226 
106142 AA424590 Hs.239631 Golgl transport complex protein (90 kDa) 2.26 
101461 M22430 Hs.76422 phosphoGpase A2; group HA (platelets; 226 
119336 T55340 Hs.208238 ESTs 226 
127619 AA627122 Hs.163787 ESTs 225 
104113 AA427510 Hs.181202 ESTs; WeaJdy similar to Wiscott-AIdiich 2.25 

131219 C00476 Hs24395 small inducible cytokine subfamily B (Cy 225 

118915 N91481 Hs.54713 ESTs 225 

127556 AA679831 Hs.190228 ESTs 224 

128700 U59286 Hs.103982 small Inducible cytokine subfamily B (Cy 224 

113674 T96374 H&5753 inositol(myo}-1 (or ^phosphatase 2 224 

133085 M73720 Hs.646 cartxsxypeptictasa A3 (mastceO) 224 

106017 AA411882 Hs26268 ESTs 224 

100582 HG2348 Peptide Yy 224 

134811 N66357 HsJ9761 APsynmase;H+transpc^g;rnitoc)iond 223 

102543 U57627 Hs.234776 oculocerebrorenal syndrome of Lowe 223 
127357 AM52788 zx39g11.r1 SoaresJotelfetusJJb2HF8_9w 

IMAGE:7889W)5\mRNAseq. 2.23 

135288 AA402930 Hs.97876 ESTs 223 

105581 AA278850 Hs28891 ESTs;WeaWysirnilartoiiHALUSUBFAMl 223 

103812 AA137107 Hs.124094 ESTs; Weakly similar to NFAT1-A (Mmuscu 2.23 

117016 H87171 Hs.52170 ESTs 222 

114607 AA079342 Hs.129057 breast carcinoma amplified seq 1 222 

134000 U29091 Hs.7833 selenium binding protein 1 222 

111069 N58461 Hs22036 ESTs 222 

129048 L27670 Hs.108287 intercellular axttieston molecute 4; Lands 221 

124995 T52700 Hs.110044 ESTs 22 

116678 F05063 Hs251736 ESTs 22 

118222 NS2263 Hs.48501 EST 22 

127888 AI149662 Hs.143590 ESTs 2.19 

113790 W33178 Hs26912 ESTs 219 
100097 AF002224 H sapiens Angelman Syndrome Gene, E6-AP 

from promoter P1 , 5TJTR 2.19 

109151 AA176800 Hs.73452 ESTs 219 

135368 AA086057 HsS964 ribosomal protein; mitochondrial; S12 2.19 

109016 AA156936 Hs.58069 ESTs; Highly similar to type II cAMP-dep 2.19 

124300 H92575 Hs.105959 ESTs; Weakly simflar to (III ALU SUBFAM1 2.16 

123450 AA598913 Hs.111207 ESTs 218 

117435 N27628 yw50b08.s1 Weizmann Olfactory Epithelium 218 

119860 W80709 Hs.58485 ESTs 2.18 

123833 AA620717 Hs.112889 ESTs 218 

107938 AA029446 Hs53115 ESTs 2.17 

119380 T83659 Hs.184407 ESTs 2.16 

114066 Z38152 Hs.26920 ESTs 2.15 

128748 T59001 Hs.10475 ESTs 215 

130414 M21 121 HS241392 small inducible cytokine A5(RANTES) 2.15 

123490 AA599723 TAP binding protein (tapasin) 215 

112588 R77302 Hs20226 ESTs • 214 

110548 H58715 Hs.14706 ESTs 2.14 

101581 M34996 Hs.19B253 ma]or rustocompaMy complex; ctess 114 

115248 AA278887 Hs.194530 ESTs; Weakly simSar to unknown IKsapie 214 

105619 AA280810 Ks24003 ESTs; Moderately similar to LEYDIG CELL 2.14 

128058 AI126617 Hs.132449 ESTs 214 
134573 AA442125 Hs.171873 ESTs; Weakly similar to PUTATIVE STER01DZ14 

134863 AA353903 Hs.183373 ATX1 (antioxidant protein 1; yeast) homo 2.14 

128811 H17317 Hs.169100 ESTs; Weakly similar to HPBRI1-7 protein 2.13 

112368 R59371 Hs.26653 EST 2.13 
108395 AA075144 zm86f6.s1 Stratagene ovarian cancer (#93 

gb:X1664 TRANSLATION ALLY CONTROLLED TUM 

129611 D45680 Hs.11614 ESTs 213 

101253 L34355 Hs.99931 sarcoglycan; alpha (50kD dystrophln-asso 212 

126701 AA515212 Hs2Q2590 ESTs; Weakly similar to mucin glycoprote Z12 

111628 R15825 Hs.4014 KIAA0946 protein; Huntinglin interacting 2.12 

108675 AA115240 Hs.61816 ESTs 2.12 

127131 Z44658 Hs.105460 DKFZP564O0823 protein 212 

109590 F02465 Hs27281 ESTs 212 

116539 D12124 Hs242890 EST 212 

112117 R45402 Hs23789 ESTs 2.12 



Rbroblasts2, BT474_ce0s, MB231jdbIIs 
PRSC.con, RPWE.2, PRSCJog 
lu_SC_H69, UtAD.358, U_AD_H23 
HS578T_cells, 293T_celis, CALU6_ceUs 
EB_celIs, DU145_ceHs, HT29_cefls 
MB4/DA-453, LNCaP.cells, OVCAR_cefls 
HT29_ceBs, BT474_Getls, Caco2 
MB-MDA-435S, MB231_ceBs, BT474.ceJIs 
LNCaP.celis, MB-MDA-453, Caco2 
HMEC, HS578T_cells, Caco2 
LNCaP.cells, BT474.ce0s, Caco2 
HS578T_ceI!s, EB.cells, HMEC 
LLSCLH520, Lu_LC_H460, Ul.SC.H69 
N©-MDA435s, Fibroblasts 2, HMEC (total RNA) 
Lu_SCLH520, BT474_cefls, Fibroblasts 2 
HMEC (total RNA), HMEC, MCF7 
HS578T_ceUs, EB.cells, HMEC 
HMEC, HS578TceRs, Fibroblasts 2 
A549_ceUs, DU145_ceUs, Lu_AD_358 
HS578T_ceUs, Fibroblasts 2, HT29_ceIls 
MB-MDA-453, OVCARjslis, 293T.ceIls 
HMEC, HS578T,ceIIs I HMEC (total RNA) 
Lu.SQ.H520, LNCaP.cells, LulAD_H23 
293T_cells f EB.cells, LNCaP.cells 

HS578T.cells, RPW^ HMEC (total RNA) 
HS578T.cells, 293T_cells, OVCAR.celIs 
BT474_cel)s, BT474.ceIIs, MB231.ceSs 
Lu_SC_H345, Lu.AD.H23. PRSC.con 
Fibroblasts 2, LuJ.C_H460, HMEC (total RNA) 
BT474.ceUs, HT29.cells, HT29 cells 
LNCaP.cells, MB-MDA-453, BT474.ceIls 
HMEC, Lu.SC.H345, HS578T.ce0s 
LuJUD_H23, HS578T.cells, Lu.SQ.H520 
Caoo2, M&M3A453, HT29_cefc 
HS578T.cells, BT474_ceUs, 293T.ceils 
HS578T.cells, BT474.ce(ls, MB231.cefls 
BT474.ceUs, CALU6.cells, MB231.ce0s 
HMEC, HMEC (total RNA), Fibroblasts 2 

HS578T_celis, CALU6_ceiis, 293T_cete 
CALU6.cefls, LuJU).H23, Lu.SC.H69 
OVCAR.cefls, A549_ceDs, Lu_AD_H23 
HS578T_ceIls, BT474.cefls, A549 cells 
IUJD 358, Lu.SC.H69, Lu_SC_H345 
HMEC (total RNA), HMEC, MB-MDA-435S 
UiCaP.cells, DU145.celis, Lu.SQ.H520 
HS578T.ceIis, MB231.ceUs, Caco2 
UUS)jm, Lu.SQ.H520, Lu_AD_358 
Caoo2, 293T_ceUs, 293T.oells 
LilADJ^, Lu^AD.358, PRSC.con 
HMEC (total RNA), HMEC, EB.celte 
HMEC, HT29.cells, MB231_ceHs 
HS578T_cefe, PCS.cefc, A549.cells 
HS578T.cells, EB.cells, Lu_SCJ<69 
HMEC (total RNA), HMEC. Fibroblasts 2 
HMEC, HMEC (total RNA), HT29.ceDs 
MB-MDA435S, HMEC, HMEC 
HT29.cells f BT474_cefls. CALU6_celIs 
LU.SCLH520, MB-MDA435s, LNCaP.cells 
HS578T_cells, EB.cells, HMEC (total RNA) 
EB.cells, MB231.ceUs, Caco2 
Lu.SC.H345, HT29.oeDs, BT474_ceils 
Caco2, Lu.SC.H345, EB.cells 
HMEC, HMEC (total RNA), Lu_SCLH520 

2.13 HMEC (total RNA), HMEC. OVCAR_calts 
HMEC, HS578T_cells, Caco2 
HS578T_ceIls, OVCAR.cells, CALU6.cells 
EB.cells, Lu.AD.H23, LuJU)_H23 
A549_ceBs, BT474.celIs l MB-MDA435s 
LuJU)_H23 ( MB-MDA453, PRSC con 
EB.cells, Lu.SC.H69, Lu.SC H69 
HMEC, HS578T.ce«s, HMEC (total RNA) 
U_AD_H23, Caco2, BT474.cells 
EB.cells, Lu_AD.H23, LU.SCLH520 
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126367 AA477929 Hs^5584 ESTs 112 
135252 U62966 Hs.97207 solute carrier family 28 (sodiuriKCupfed 2.11 
117565 N34301 Hs.248426 EST 111 
129430 AA258B42 Hs.197877 H sapiens done 23777 putative transmemb 111 
120256 AA169801 sema domain; Immunoglobulin domain (Ig); 2.11 

1341© D20342 Hs.178137 transducer of ERBB2;1 (TOB1) 2.11 
130397 AA487452 Hs.155344 DNA fragmentation factor; 45 kD; alpha s 2.11 
132859 020925 Hs.5842 ESTs 2.11 
117633 N36404 Hs.44807 ESTs 2.11 
125003 T59442 Hs. 100445 ESTs 2.11 
125329 AA825437 Hs.58875 ESTs 2.11 
114065 Z38149 Hs.134015 uronyl 2^fotransferase 2.11 

120718 AA292747 Hs.97296 ESTs 2.11 

133869 T49444 Hs.77031 Sp2 transcription factor 2.1 

135351 AA430179 Hs.9933 putative A(>fike transpose 11 

110973 N51529 Hs.118047 ESTs 2.09 

131879 AA017161 Hs.33792 ESTs 2.09 

116656 F03935 Hs.241640 EST 2.09 
120311 AA194074 Hs.193401 ESTs 109 
108024 AA040433 Hs.61898 DKFZP586N2124 protein 2.09 
105671 AA399633 Hs.24872 ESTs 109 
120206 Z40805 Hs.91668 ESTs 2.09 
112333 R56222 Hs.26514 ESTs 2.09 
116746 H04811 Hs.79027 ESTs 108 
121529 AA412257 Hs.98121 ESTs 108 
105592 AA279337 Hs.180549 ESTs; Highly similar to R26660J; parfia 2.08 
108582 AA088231 Hs.91732 ESTs 2.08 
123197 AA489250 Hs. 59403 serine palmitoyltransferase; subunit II 2.08 
134965 J05480 Hs.92 protein phosphatase 3 (formerly 2B); cat 108 
123856 AA620814 Hs.144959 ESTs 2.08 
132058 AA251737 Hs.172818 Apg12 (autophagy 12; S. cerevtstaeHfee 2.07 
126476 R94666 Hs.195155 ESTs; Weakly similar to transporter prat 2.07 
106087 M418740 Hs.21111 ESTs 107 
103802 AA122003 Hs.62954 ferritin; heavy polypeptide 1 2.07 
125633 AA908225 Hs.126841 ESTs 2.07 
112817 R98491 Hs.14584 ESTs 2.07 
111050 N56984 Hs.74335 heat shock 90kD protein 1 ; beta 2.07 
133072 AA425294 Hs.64322 ESTs; Weakly similar to Closely related 2.07 
118270 N62868 Hs.48653 ESTs 2.07 
105035 AA128486 Hs.8859 ESTs 2.07 
102337 U36922 Human fork head domain protein (FKHR) mR 2.07 
109687 F09380 Hs.182859 Hfeguard 106 
109802 F10789 Hs.12439 ESTs 2.06 
128103 AA905960 Hs.48516 ESTs 2.06 
128278 AI018343 Hs.131275 ESTs 2.06 
131873 H39997 Hs^3716 ESTs 2.06 
122683 AA455528 Hs.96772 ESTs 2.05 
128066 AA884838 Hs.189171 ESTs 105 
131451 N28028 Hs.26968 H saptens mRNA from chrwrosome 5q21-22; 2.05 
120887 AA365644 Hs.97043 ESTs 2.05 
103966 AA303166 Hs.127270 ESTs 2.05 
105861 AA399260 Hs^8454 ESTs 105 
104627 AA001976 Hs.19603 ESTs 105 
108794 AA129468 Hs.203392 ESTs 2.04 
111896 R38936 Hs.24894 H sapiens clone 25248 mRNA seq 2.04 
101849 M94167 Hs.172816 neureguiinl 104 
119913 W85931 Hs.58785 ESTs 2.04 
130785 AA242826 Hs.19405 caspase recruitment domain 4 204 
124702 R06984 Hs.7745 ESTs; Weakly similar to TEST1S-SPECIFIC 103 
106769 AA478001 Hs.225935 dracylglycerol Oacyttransferase (mouse) 103 
132219 N48682 Hs.172971 ESTs 2.03 
122033 AA431334 Hs.109297 ESTs 2.03 
120461 AA251301 zs10b02.s1 NCLCGAPJ3CB1 H sapiens cDNA 

contains Atu repetitive element, mRNA 2.03 

134959 U90550 Hs.91813 butyrophiHn; subfamily 2; member A2 2.03 

104909 AA055892 Hs.14543 ESTs 103 

101950 S79219 Hs.80741 propbnyi Coenzyme A carboxylase; alpha 2.03 

133878 D78947 Hs.7718 ESTs; Weakly similar to weak similarity 102 

103459 X99894 Hs.32938 insulin promoter factor 1; homeodomain t 102 

125507 A1436377 Hs.258590 tetraspanin TM4-B 2,02 

116657 F04014 Hs.65996 ESTs 101 
112920 T10234 Hs.4275 ESTs 101 
105533 AA258572 Hs.6418 ESTs; Moderately similar to seven transm 101 
126762 AA064671 zm13b04.r1 Stratagene pancreas (#937208) 

simllartoTR:G413842 G413842 NONCLASSI 

128999 R37808 Hs.107765 ESTs 101 



Lu.SC_H69, LiLAD_H23, Lu_AD_358 
MB-MDA-435S, 293T_cells, CALU6_cells 
HMEC, HS578T_ce!ls, MB231.cells 
HS578T.cefls, Lu_AD_358 ( MB-MDA-435S 
HMEC, HMEC (total RNA), EB.cells 
HMEC (total RNA), 293T_cells, OVCARjslis 
293T_cefls, Caco2, LuJVD.H23 
HMEC (total RNA), Fibroblasts 2, HMEC 
HMEC, Caco2, HS578T.ce0s 
M&M)A-435s, HMEC (total RNA), HT29_cells 
HS578T.ceii3, PRSC.con, PRSCJog 
MB-MDA435S, 293T_ceUs, PRSC.con 
H729.ceBs, Lu_AD_H23, Lu.SC.H69 
Lu.LC.H460, UlAD.358, RPWEJ 
HS578T.ce0s, EB.ceBs, HMEC 
EBLcells, HS578T.ceDs, MCF7 
HMEC (total RNA), MB231.cefe, BT474 cells 
HS578T_calls p UUC.H460, Lu_SC.H69 
OVCAR.ceHs, HMEC (total RNA), HMEC 
HMEC (total RNA), BT474_cells, HT29_cells 
Fibroblasts! A549_celis. HS578T.cells 
BT474.cells, MB-MDA-453, EB.cells 
UlAD_H23, RbroWasisl UJLCJW60 
MB-MDA-435S, HMEC (total RNA), UlSC_jH345 
HMEC, HMEC (total RNA), HS578T_ceHs 
LNCaP.cefe, PRSCJog, PRSC.bg 
HS578T_ceils, Lu_SC_H345, Lu_SC_H69 
EB.cells, Lu.SC.H69, Lu.SC.H345 
LNCaP.ceOs, MB-MDA-435S, HMEC 
HS578T.cefls, BT474.ceBs, BT474_oefls 
HS578T.cefis, MCF7, HMEC 
PRSCJog, LUCJ-M60, RPWEJ. 
OVCAFLcefe, A549_ceUs, LuJU).H23 
HMEC, HMEC (total RNA), HS578T.ceils 
EB.cells, Fibroblasts! Lu.SC.H69 
HMEC. HMEC (total RNA), Ffcroblasts2 
LNCaP.ceBs, DU145_celIs, 293T_ceDs 
LNCaP.ceBs, MB-MDA453, Caco2 
HMEC (total RNA), HMEC, EB.cells 
LNCaP.cells, PC3_ceils, EB.cells 
293T_ceHs, HMEC, HT29_cells 
BT474_ceIls 1 BT474_ceDs, LilAD_H23 
EB cells, EB cells, Caco2 
HT29_ceIis, HMEC (total RNA), HMEC 
PRSC.con, U.SCJH345, HS57ST_cel!s 
HMEC (total RNA), HMEC, EB.cells 
LNCaP_ceb, UlAD.H23, HS578T_cells 
HMEC, HMEC (total RNA), Fibroblasts 2 
MB-MDA-435S, LUJ.CJW60, Lu_SCLH520 
HS578T.oetls, PRSC.con, HMEC 
HMEC (total RNA), LNCaP.cells, PC3_ceHs 
Fibroblasts! HMEC (total RNA), EB.ceOs 
HS578T_cells, HMEC, BT474_ceiis 
HS578T.celis, HMEC, A549.ceDs 
HS578T.cete, PC3.C8IIS, 293T.cefls 
HMEC. HS578T_ceBs, HMEC (total RNA) 
HMEC, BT474_ce0s I MB231_cdls 
HMEC, HS578T_cefls, BT474.ceHs 
Fibroblasts 2, PRSC.con, HMEC 
PC3_ceUs, EB.ceBs, HS578T_cells 
HT29.cete, PC3.cells, A549.cells 
OVCAR.ce0s, A549_ceBs, Caco2 

HS578T_cefe, EB.cells, EB.cells 
HMEC, Fibroblasts 2, EB.cells 
Lu.SC.H345, PC3_ceBs, DU145.cells 
Lu.SC.H69, EB.cells, CALU6_cel!s 
EB.cells, MCF7, MB231.cells 
EB.ceOs, LLADJ123, LlAD.358 
A549_ceHs, LU.SCLH520, Lu_AD.H23 
HS578T_cel!s, HMEC, MB231.ce0s 
HS578T_ceIls, EB.cells, PRSC.con 
HS578T_ceBs, HMEC, EB.cells 

101 RPWEJ, LuJVD.H23, IolAD.358 
HS578T_cens, OVCAR.cells, EB.ceBs 
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Table 2 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigcncID: Unigene number 

Unigene Title: Unigene gene title 



ExAccn UniGJD CompleteJTtle 



Ratio Mets/BS Top 3 expressing cell Ones 



101447 
105039 
106094 
105777 
12981B 
118475 
112170 
114918 
104590 
120825 
115650 
124568 
134238 
114721 
106145 
114610 

130281 
124690 
113490 
104425 
118828 
129076 
109684 
104558 

109032 
129350 
112662 
132902 
126872 
122528 
102193 
121332 
135357 
109141 
135324 
124875 
102380 
127956 
125038 
102515 
109027 
115096 
123470 
113219 
123433 
1351B2 
121721 
129126 
135232 
124847 
110349 
134402 
115494 
119174 



M21305 

AA130349 

AA419461 

AA348412 

N54841 

N66845 

R48744 

AA236813 

R79750 

AA285053 

AA404564 

N67086 

R81509 

AA131450 

AA424791 

AA081079 

R12777 

R05818 

T88700 

H88496 

N79496 

AA262179 

F09317 

R56678 

AA158234 



R85436 



AA1! 
AA449804 
U20758 
AA404384 
AA235803 
AA176428 
AA082041 
R70506 
U40434 
AA826117 
T78Q89 
U52696 
AA157818 
AA255991 
AA599106 
T59257 
AA598661 
M28170 
AA419470 
H88486 
AA342457 
R60044 



U25165 

AA290603 

R71234 



121943 AA429265 
110856 N33063 
102474 U49973 
123458 AA598963 
116459 AA621399 
126301 N62371 
123461 



Hs.247946 

Hs^6475 

Hs.18127 

Hs.23096 

Hs.172572 

Hs.165411 

Hs.192878 

Hs.72324 

Hs.63523 

Hs.107168 

Hs.47094 

Hs.102000 

Hs.184571 

Hs.103822 

Hs.5734 



Hs.15395 

Hs.173830 

Hs.173374 

Hs.40583 

Hs.50824 

Hs.169343 

Hs.140885 

H&88959 

Hs.72222 
Hs.110630 
Hs.193150 
Hs.168147 

Hs.250992 

Hs.313 

Hs.97921 

Hs.79572 

Hs.193360 

Hs.9873 

Hs307693 

Hs.155981 

Hs.194013 

Hs.168887 

Hs.238380 

Hs.175319 

Hs.194208 

Hs.194407 

Hs.1 12478 

Hs.96023 

Hs.199961 

Hs. 108806 

Hs.96600 

Hs.106706 

Hs.82712 
Hs.256517 



HS.1S759 
Hs.23291 

Hs.1 12499 
Hs.64193 
Hs.100043 
Hs.251119 



Human alpha satelfite and satellite 3 ju 110.98 

ESTs 9.13 

ESTs 8.51 

ESTs 8.4 

ESTs 7.2 

ESTs; Weakly similar to ALU CLASS B 7 

ESTs 6.91 

ESTs; Highly similar to unknown [H^apie 6.6 

nuclear receptor subfamily 1; group I; m 6.58 

ESTs 6.55 

ESTs 6.43 

ESTs 6.35 

spSctng {acton anginine/sertne-rfch 1 1 6.32 

ESTs 6.13 

KIAA0679 protein 6 
zn32h9.s1 Strafegene endoflieiial cell 93 

IMAGE:5491 85 3\ mRNA seq 5.97 
ESTs; Weakly similar to ARGINYL-TRNA SYN 

ESTs 5.92 

ESTs 5.81 

ESTs 5.77 

EST 5.45 

ESTs 5.35 

ESTs; Weakly similar to UNE-1 REVERSE T 5.34 
Human DNA seq from clone 967N21 on chr 2 

part of KJAA0172; the gene for a novel 5.32 

ESTs 5.23 

Human BRCA2 region; mRNA seq CG006 5.2 

ESTs 5.2 

ESTs 5.16 

ESTs 5.04 

EST 5.04 

secreted phosphoproiein 1 (osteopontin; 5.02 

ESTs 5.01 

cathepsin D (lysosomal aspartyl protease 4.96 

ESTs 4.86 

ESTs 4.83 

ESTs; Weakly simBar to (ill ALU SUBFAMI 4.75 

mesothelin 4.71 

ESTs 4.69 

ESTs 43 

Humn adrenal Creb-rp hmlg (Creb-rp), com 4,57 

Human endogenous retroviral protease mRN 4.57 



ESTs 
ESTs 
ESTs 
ESTs 

CD19 antigen 

ESTs 

ESTs 

ESTs; Moderately similar to III! ALU SUB 
ESTs; Highly similar to BETA-CATENIN [H. 
ESTs; Weakly similar to lilt ALU SUBFAMI 
fragile X mental retardation; autosomal 
ESTs 

yi54c08.s1 Scares placenta Nb2HP H saple 
transcript (rRNA); gb:S41458 ROD CGMP- 
BETA-SUBUNIT (HUMAN);contain 
ESTs 

ESTs; Weakly similar to S164 [H^apiens] 
Human Tjggerl transposable element, comp 
KIAA0612 protein 
ESTs 

ESTs; Weakly similar to Similar to cutic 
EST 



4.57 
4.55 
4.55 
4.55 
4.53 
4.51 
4.45 
4.43 
4.42 
439 
4.38 
4.36 



433 

a 

4.28 
428 
4.27 
422 
422 
422 



EB.celis, Fibroblasts 2, A549_cefls 
EB.celis, OVCAR.celis, Lu.SC.H345 
HT29.cells, MB-MDA-453, HS578T.cells 
293Tjoefls t OVCAR_cells, EB_ceOs 
Lu_SC_H69, EB.celis, Lu.SC.H345 
DU145_ce2s, EB.celis, Caco2 
293T.ceils, DU145_ceife, HT29_cells 
EB.celis, 293T.cells, DU145_ceKs 
293T.celIs, OVCAFLcelJs, HMEC 
CALU6.ce0s, OVCAR_cells, EB__oeUs 
EB.celis, LNCaP.cells, Lu.SC.H345 
PC3.cells, A549_cells. DU145_ceJIs 
293T.cells, Lu_SC_H345, HMEC 
Caco2, MBWDA-435s, PRSCJog 
OVCAJLceils, EB.celis, 293T.ceiis 

PRSC.con, DU145.cete, HS578T.celis 
5.94 PRSC.con, HT29.cells, EB.celis 
LNCaP.cefls, EB.celis, OVCAR_cells 
DU145_obDs, PC3_ceBs. HMEC (total RNA) 
OVCAR_cefls, HS578T_cetls, A549.ceUs 
LNCaP.cells, OVCAR.oefls, DU145_ceBs 
293T.cel!s, BT474_ceDs, MCF7 
Fibroblasts 2, Lu_SC.H69 f DU145_ceIls 

EB cells, PC3.cells, Lu.SC_H345 
HT29.cefe r PC3_ceBs, UlAD.358 
293T cefis, EB cells, DU145 cells 
MB4rfbA435s,~PRSC.con, MB-MDA-453 
PC3.ceHs, LNCaP.cells, CALU6.cefls 
EB.celis, Fibroblasts 2, A549_ceils 
Lu.SC.H345, PRSC.con, LNCaP.cells 
Lu.LC.H460, A549.cells ( MMDA435s 
EB cells, Lu.SC.H69, DU145_ceDs 
EB cells, MCF7, DU145.ce«s 
DU145_ceDs, PC3.cells, PRSCJog 
EB.celis, Lu_SC_H345, HS578T.cells 
DU145.ceDs, OVCAR_cefls, LNCaP.cells 
OVCAFLcells, UiJDjm, RPWE.2 
EB cells, HS578T_cefls, DU145.cetls 
OVCAR.cete, 293T.ceBs, DU145_ce!ls 
Ui SC.H345, Lu.SC_H69, HT29_cells 
PC3_ce8s, EB.celis, Lu.SCLH520 
OVCAR_ceIls, 293T_ce!Is, PC3.ceHs 
LNCaP.ceOs, Lu.SC_H69, 293T_cells 
A549.ceBs, 293T.cefls ( 293T.celIs 
EB.celis, OVCAR.celfc, HT29.cells 
OVCAR_ceHs, DU145.cells, EB.celis 
DU145_cells, LNCaP.cells, EB.celis 
LNCaP.cells, Caco2, EB.celis 
LNCaP.cells, DU145.ceBs, OVCAR.cefls 
OVCAR.cefls, CALUe.cells, CALU6_ceDs 
DU145.cells, OVCARj*Us, LNCaP.cells 
HS578T.cells, OVCAFLceOs, DU145.cells 
Lu.SC_tf345, OVCAR.cells, PC3.ce»s 



DU145.ceIls, OVCAR.cefls, LNCaP.cells 
EB.celis, HT29_ceIls, Lu_SC_H69 
OVCARjslls, EB.celis, Lu.SC.H69 
DU145.ceBs, LNCaP.cells, OVCAfLcells 
A549.ceHs, A549_cells, BT474_cells 
Caco2, HS578T.ceIls, MB-MDA435s 
PC3 ceBs, DU145.ce!ls, Lu.SC.H345 
LU.SC-H345, Lu.SC.H69, OVCAFLceOs 
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130588 AA287735 Hs.16411 Human DNAseq from clone 1189B24 on chro 



MLRQ subunit (EC 16.53; EC 1.6.99.3; 

Tyrostoe-protein Kinase FER (EC 2.7.1 .1 4.2 

125756 W25498 Hs.81634 ATP synthase; H+ transporting; mtochond 4.2 

135009 AA040507 Hs2518B5 ESTs 4.19 

107001 AA598589 Hs.24492 ESTs 4.18 

124896 R82063 Hs.101594 EST 4.16 

1 19404 T92950 ye27c10.s1 Stratagene lung (#937210) H s 4.15 

125090 T91518 ye20K)5.s1 Stratagene lung (#937210) H 8 

contains Atu repetitive eterrentcontaln 4. 14 

117348 N24157 Hs.139615 ESTs 4.1 

111389 N95837 Hs.169111 ESTs; Weakly simitar to L82A [D.melanoga 4.1 

134977 AA464698 Hs.19390 ESTs; Weakly similar to bullous pemphigo 4.09 

124696 R06273 Hs.186467 ESTs; Moderately similar to (ill ALU SUB 4.09 

124090 H09570 Hs.143032 ESTs; Weakly similar to neuronal thread 3.98 

133992 R46354 Hs.169832 zinc finger protein 42 (myetoitepedfic 3.98 

126009 H51652 Hs.242985 hemoglobin; gamma 6 3.96 

114161 Z38904 Hs22385 ESTs; Weakly similar to KJAA0970 protein 3.94 

109171 AA1B0356 Hs.73700 EST 3.94 

122007 AA430629 Hs.98564 ESTs 3.93 

131936 AA094865 Hs.179972 interferon; alpha^nduclbte protein (do 3.9 

126668 AA194849 Hs.103422 ESTs 3.9 

124977 T33859 Hs.190452 K1AA0365 gene product 3.89 

107048 AA600012 Hs.10669 ESTs; Moderately similar to KIAA0400 [H, 3.89 

105358 AA236034 H&25362 ESTs 3.89 

135106 AA599037 Hs.9456 SWI/SNF related; matrix assccd; actin de 3.86 

106686 AA463215 Hs.29896 ESTs; Weakly similar to proGne-fich pro 3.85 

132093 AA400091 Hs.39421 ESTs 3.85 

128651 AA446990 Hs.103135 ESTs 3.84 

102459 U48936 Human amilondeHsensitive epithelial sod 3.84 

113732 T98288 Hs.193295 ESTs; Weakly similar to III) ALU SUBFAMI 3.82 

116000 AA448710 Hs.41327 ESTs 3.82 

120748 AA303153 Hs.237994 EST; Weakly similar to HI! ALU SUBFAM1L 3.82 

116318 AA490830 Hs.58570 deleted in cancer 1 ; RNA heHcase HDB/Dl 3.79 

114366 Z41747 Hs.469 succinate dehydrogenase complex; subunit 3.78 

107248 D59894 H&34782 ESTs 3.75 

132713 AA286906 Hs.55335 ESTs 3.75 

102222 U24683 Hs. 159386 Immunoglobulin heavy variable <W 3.73 

108201 AA057518 Hs.63394 ESTs 3.72 

119940 W86779 Hs.171807 DKFZP586B0319 protein 3.71 

106508 AA452590 Hs.30348 ESTs 3.67 

114360 Z41592 Hs.22129 hypothetical protein 3.67 

100991 J03764 Hs.82085 plasminogen activator tnhfcitor; type I 3.67 

107580 AA002091 Hs.175476 ESTs; Weakly similar to till ALU SUBFAMI 3.67 

111685 R21408 Hs.106095 ESTs 3.66 

128336 AI242720 Hs.146043 ESTs; Weakly simitar to alternatively sp 3.66 

130868 AA0Q4900 Hs.171917 ESTs; Weakly smir to smir to gfycerophos 3.61 

116802 H44061 Hs.194Q26 ESTs 3.6 

130753 Z46632 Hs.189 phosphodiesterase 4C; cAMP-specific (dun 3.6 

123074 AA485117 Hs.105653 ESTs 3.6 

114317 Z41038 Hs.469 succinate dehydrogenase complex; subunit 3.6 

134194 AA233231 Hs.79828 ESTs 3.59 

127752 AA808388 Hs.211167 ESTs 3.59 

123526 AA60B657 ESTs; Moderately similar to till ALU SUB 3.59 

127917 AA211895 Hs.118831 EST; Highly simSar to dJ1163J1.Z1 [Hs 3.58 

105941 AA404427 Hs.10669 ESTs; Moderately similar to KIAA0400 [H. 3.58 

124694 R06108 Hs.135258 ESTs 3.56 

105656 AA282571 Hs.203772 FSHD region gene 1 356 

111168 N66951 Hs2383B0 Human endogenous retroviral protease mRN 3.55 

133254 AA156670 Hs.180780 H sapiens agrin precursor mRNA; partial 3.54 

132640 U33821 Taxi (human T-cell leukemia virus type) 3.53 

116562 D25807 Hs.90145 ESTs 352 

126045 N80361 Hs.14248 ESTs 351 

122878 AA465341 Hs.99640 ESTs 3.47 

105220 AA210695 Hs.17212 ESTs 3.47 

127001 AA731636 Hs59319 ESTs; Weakly similar to 111! ALU SUBFAMI 3.45 

112693 R88741 Hs.91065 ESTs; Moderately similar to proltferatio 3.44 

104935 AA063280 Hs.35552 ESTs 3.43 

128710 J04813 Hs.1041 17 cytochrome P450; subfamily IIIA (niphedl 3.41 

131996 D86956 Hs.36927 heat shock 105kO 3.4 

1 19229 T03229 H sapiens (clone 104) retinoblastoma 1 g 3.4 

128046 AA873285 Hs.137947 ESTs 3.39 

105175 AA186804 Hs.25740 ESTs; Weakly similar to ubiquitous TPR m 3.39 

132349 Y00705 Hs.181266 serine protease Inhibitor; Kazal type 1 3.38 

101559 M32053 Human H19 RNA gene, complete cds 3.37 

116389 AA599011 troponin T1; skeletal; slow 3.36 



EB.cefls, LNCaP.cells, MCF7 
HMEC, EB.ceOs, DU145.cells 
293T_ceDs, EB.cells, DU145_cells 
293T_ce&, DU145.oeQs, EB.cells 
OVCAf^oelb, Lu.SC.H345, HMEC (total RNA) 
DU145_ceits, PC3.cefls, Fibroblasts 2 

LNCaP.ceOs, DU145_ceCs, OVCAR_ceDs 
Lu_SC_H345 ( UlSC.H69, PRSCJog 
DU145_ceDs ( MCF7, LNCaP.cells 
OVCARjaBs, Fibroblasts 2, Lu_SC_H69 
OVCAfLcelb. Lu_SC_H345, PRSC.con 
DU145.C8II8, OVCAR_ceHs, Lu.SC.H345 
HT29_cete, MB231_ceDs, BT474_cells 
Lu_SC.H69, OVGARjnfe, EB.cefls 
HS578T_ceBs, EB_cells, PRSC_con 
293T_celis, MB-MDA-435S, A549.cet!s 
PC3.ce0s, A549.ceil8, OVCARj»Ils 
CALUS.cells, EB_celIs, Lu.SC.H69 
LuJU)_H23, EB.cells, Lu.SC_H69 
293T_celIs, DU145_ceDs, EB.cells 
PC3_cefls, HS578T.cells, DU145_cefls 
Caco2, EB.cells, CALU6_cells 
EB_cefls, LNCaP.cefls, Caco2 
OVCAR_cens, DU145.ceBs, EB.cells 
OVCAR_ceHs, OVCAR_ceIls, LNCaP.ceBs 
EB.cells, LNCaP.cells, OVCAR.cells 
HT29_ceBs, BT474_cefe, Lu.SC.H69 
DU145_cefl3 ( OVCAR.cells, LNCaP.cells 
DU145_cefls, MB-MDA453, Lu_SC_H69 
DU145_cefls, DU145.ceBs, Lu_SC_H345 
MB-MDA453, CALU6_ceIls, EB.cells 
DU145.C8US, Fibroblasts 2, Caco2 
LNCaP.cells, DU145_ceIls, EB.ceOs 
OVCAR_cells ( EB.cells, Lu_SC_H345 
EBLceDs, OVCAR_ceDs, 293T.celb 
293T_ce!ls ( DU145_ceus, EB.cells 
EB.cells, Caco2, DU145_cetls 
EB.cells, LNCaP.cells, 293T_ceDs 
HT29_ce0s, Lu.SQ.H520. Lu_SQ_H520 
Rbroblasis2, HS578T_celis, MB231.celis 
OVCAFLceHs, LNCaP.cefls, Lu.SC.H345 
OVCAR_cefls, A549.ceiis, 293T.ceIb 
LU.SCJJ345, Caco2, OVCAR_cells 
EB_celis, HS578T_ceIis, LNCaP.cells 
Lu.SC.H345, OVCAR.cells, DU145_ceHs 
Lu.SC.H69, UJU5JH23, Lu.SC.H345 
293T_celis, MB231.ceiis, Fibroblasts 2 
DU145_ceHs r HS578T_ceHs ( CALU6_cefc 
BT474 cells, MB231 .cells, HT29_cetls 
Lu.SQ.H520, MB-MDA-435S, DU145.cells 
DU145.cete, OVCAR^cells, LNCaP.cells 
Lu.SC.H345, OVCAR.cells, PRSC.con 
PC3.ceDs, DU145.ce!is, HS578T_cefls 
LuJU)_H23, Lu.SQ.H520, tu_AD.358 
DU145_celb, EB.cells, A549_ceDs 
PC3.cells, EB.ceBs, MB231.cells 
OVCARjaQs, DU145_ceBs, PC3_cete 
MB231_ceDs, CALU6.ceHs, BT474_cells 
MB231_celIs, BT474.ceiis, Lu.SC.H345 
DU145_celis, Lu_SC.H345, OVCAR_celIs 
HT29.celIs, OVCAR_cells ( HMEC 
MB-MDA435S, HT29.ceHs, HT29_ce!ls 
LNCaP.cells, DU145_cells, Lu_SC_H345 
EB.cefls, LNCaP.cells, DU145.ceils 
LNCaP.cells, CALU6_cefls, 293T_ceBs 
HT29 oeQs, A549.ceHs, Fibroblasts 2 
EB.cefls, PC3.ceHs, Lu_SC_H345 
DU145.ceHs, Lu.SC.H345, EB.cells 
EB cells, LNCaP.cells, DU145.ceDs 
PC3.cells, MCF7, DU145.cells 
Caco2, EB.cefls, Lu.SC.K69 
Lu.SC.H69, MCF7, OVCARjalls 
DU145.ceBs, LNCaP.cells, OVCAR.cefls 
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130641 AA182001 Hs.17155 ESTs 3.36 

103362 AA214615 Hs.194348 ESTs 3.33 

106278 AA432292 Hs.23388 ESTs; Moderately simBar to till ALU SUB 3.33 

127241 AA321849 Hs.248340 H sapiens mRNA; cDNA DKFZp564J2116 (from 

133339 N64588 Hs.71252 ESTs 3.32 

113260 T64896 Hs.237992 ESTs 3.32 

133349 N75791 Hs.7153 L-3-hydraxyacyl-Coen2yme A dehydrogenase 3.31 

107149 AA621159 Hs23284 ESTs 329 

133195 AA350744 Hs.181409 KIAA1007 protein 329 

111302 N73838 Hs, 15049 ESTs 329 

106414 AA447971 Hs28827 ESTs 328 

121768 AA421561 Hs251664 insuWke growth factor 2 (somatomedi 328 

117176 H98670 Hs.49753 ESTs; Weakty similar to hypothetical pro 3.28 

131320 AA171948 Hs.145696 splicing factor (CC 1.3) 3.28 

100700 HG3227-H Guanine Nucteotide-Bindlng Protein Hsrl 3.27 

134275 AA132328 Hs.3688 arid-mdurible phosphoprotein 326 

117667 N39214 H&44708 Ser-Thr protein kinase related to the my 326 

124889 R78604 Hs.101570 ESTs 325 

126631 W95117 Hs.193337 ESTs 325 

105643 AA282069 Hs. 173802 KIAA0603 gene product 324 

132718 AA056731 H&554 Sjogren syndrome antigen A2 (6GkD; ribon 324 

116417 AA609309 Hs239302 ESTs; Weakly similar to III! ALU SUBFAMi 324 

108039 AA041341 Hs.46670 ESTs 324 

114116 238496 Hs. 103283 K1AA0594 protein 3.23 

124514 N58045 Hs.142737 ESTs ' 3.22 

110802 N26651 Hs252748 ESTs 322 

106920 AA490899 Hs24462 ESTs 322 

123523 AA608588 Hs.193634 ESTs 321 

131564 AA491465 Hs28792 ESTs 32 

119423 T99544 Hs.173734 ESTs; WeaWy similar to Hit ALU CLASS B 32 

128736 F03934 Hs.104607 ESTs 3.19 

101511 M27826 Hs238380 Human endogenous retroviral protease mRN 3.18 

114509 AA043551 Hs.95249 ESTs 3.18 

124196 H52617 Hs.144167 ESTs 3.17 

129095 L12350 Hs.108623 thrombospondin 2 3.17 

116457 AA621367 Hs.119683 ESTs 3.17 

117040 H89112 yw25e5.s1 Morton Fetal Cochlea H sapiens 3.16 

129112 N32521 Hs.108738 ESTs 3.16 

130418 J03242 Hs251664 Insulin-Eke growth factor 2 (somatomedi 3.16 

131199 R80048 Hs234433 ESTs; WeaWy similar to transporter prot 3.15 

110357 H41529 Hs33549 ESTs; Highly similar to sulfonylurea rec 3.15 

130068 AA608903 Hs.1Q6220 K1AA0336 gene product 3.15 

127423 T47546 Hs.1 19252 tumor protein; transtettonal^controHe 3.15 

105028 AA126719 Hs.25282 ESTs 3.14 

102349 U37547 Hs.75263 apoptosis inhibitor 1 3.14 

105126 AA157814 Hs.36288 ESTs 3.13 

115465 AA286941 Hs.43691 ESTs 3.12 

133246 AA086452 Hs.68731 triadin 3.12 

122698 AA456112 Hs.99410 ESTs 3.12 

123553 AA608841 Hs. 111977 ESTs 3.12 

133437 R57419 Hs.7370 ESTs 3.11 

104956 AA074880 H&120975 ESTs; WeaJdy simBar to hypothetical pro 3.11 

116314 AA490588 Hs.43118 ESTs 3.11 

120562 AA280036 Hs.173912 eukaryotic translation initiation factor 3.11 

108770 AA127845 Hs.71027 EST 3.11 

129791 F02778 Hs.1738S7 K1AA0876 protein 3.1 

115783 AA424487 Hs.72289 ESTs; Weakly similar to UV-1 protein (H 3.09 

107630 AA007218 Hs.60178 ESTs 3.07 

124339 H99093 Hs.6179 H sapiens mRNA; cDNA DKFZp586K2322 (from 

122314 AA442257 Hs.192076 ESTs 3.07 

104589 R79299 Hs241160 ESTs; Moderately similar to HII ALU SUB 3.07 

115667 AA410508 Hs.183765 ESTs; Moderately smlr to ORF derived frm 3.06 

123796 AA620390 Hs247444 ESTs 3.06 

106483 AA451676 H&3Q299 IGF-JI mRNA-bindlng protein 2 3.06 

133318 AA256168 Hs.70838 ESTs 3.05 

117244 N20979 Hs.1757 L1 cell adhesion molecule (hydrocephalus 

thumbs) syndrome; spastic paraplegia 1) 3.05 

130797 AA430050 Hs.180948 KIAA0729 protein 3.05 

128959 D79791 Hs.107381 ESTs; WeaWy simBar to F38A5.1 [Cetega 3.05 

120481 AA252703 Hs.191754 ESTs 3.04 

126649 AA856990 Hs.125056 ESTs 3.03 

106970 AA504835 Hs24252 ESTs 3.03 

126488 N34935 Hs25633 ESTs; Highly similar to ARF GTPase-activ 3.03 

119498 W37226 Hs.55573 ESTs 3.01 

129967 H99653 Hs.138618 ESTs 3.01 

130698 AA037357 Hs.188212 ESTs 3.01 



DU145.cells, MB-MDA-435S, HS578T_cells 
HT29.oells ( Fibroblasts 2, BT474.cells 
EB.cells, Fibroblasts 2, BT474_ceBs 
3.32 LNCaP.ceils, DU145.cells, EB.cells 
DU145.ceHs, EB.ceBs, Caco2 
UuSCJC45. LNCaP.celis, Lu.SC.H69 
Caco2, EB.cells, OVCAR.cells 
HS578T.ce)ls, DU145.celis, PRSC.con 
EB.cells, UbAD_H23, Lu_AD.358 
DU145_ceB3, EB.cells, HS578T.ceils 
A549_cetts, OVCAR^ceBs, PC3_ceits 
Caco2, PRSC.con, PRSCJog 
PRSC_tog, CALU6_cefls, OVCARjslls 
EB.cells, LNCaP.celis, DU145_cells 
EB.cells, RPWEJ, UlAD_H23 
EB.cells, DU145.cells, LNCaP.celis 
LNCaP.ceils, DU145_cells, M&MDA-453 
LILAD.H23, Lu.SC.H69, UcSC_H345 
Lu.SC.H345, OVCAfLcells, Lu_SC_HB9 
Caco2, EB.cells, 293T_celte 
CALU6.ceDs, OVCAR.ceits, A549.cefls 
A549_cells, CALU6.cefc, 293T_ce!ls 
293T.cbDs, EB.cells, Caco2 
DU145_cells, OVCAR.ceib, EB.cells 
EB cells, Caco2, Lu_SQJ^520 
LNCaP.ceBs, MB-MDA-435s, N&MDA-453 
DU145_ceHs, EB.cells, OVCAR.cells 
DU145_ceHs, LNCaP.ceils, OVCAR_ceils 
HS578T_cells, HMEC (total RNA), HMEC 
EB.ceDs, DU145_ceHs, Caco2 
PC3 ceils, Lu.SO.H520, Lu.SC.H69 
PC3_celIs, DU145_celis, Lu.SQ_H520 
EB.ceSs, Lu_SC_H345, DUl45_ceib 
BT474_ceils, MB231_ceils, HMEC 
Fibroblasts 2, PRSC.con, PRSCJog 
293T.cefls, Lu.SC.H345, CALU6_cells 
OVCARjalls, 293T_ceDs. EB.cells 
EB.cefls, Fforablasts 2, MB231_ceils 
Caco2, PRSC.con, PRSCJog 
PC3.ceDs, EB_celis, OVCAR.celis 
Lu.SC.H345, PRSC.con, Lu_AD.H23 
OVCAR_cefls, CALU6.ceQs, HS578T_celIs 
EB.cells, PRSC.con, LNCaP.celJs 
LNCaP cells, PC3_ceiis, EB.ceDs 
DU145.CB0S, HS578T.ceDs, LNCaP.ceils 
EB.cells, HS578T.ceUs, LNCaP.ceils 
EB cells, DU145.ceHs, 293T.cefc 
Lu.S(m520, UlAD_H23, PRSCJog 
DU145.ceIls, OVCAR.cells, A549_ce]ls 
EB.cells, Caoo2, DU145_cefls 
HS578Tjcells, 293T_ceDs, Caoo2 
OVCARjalls, Fibroblasts I Caco2 
EB cells, MB-MDA-435s, HT29 ceDs 
LNCaP.ceils, DU145_oeIls, EB.cefis 
Lu.LC.H460, Lu.SC.H345, LuJU).358 
Lu_SC_H345, Lu_SC_H69, PRSCJog 
LitAD_358, EB.ceDs, PC3.celis 
Lu.SC.H345, CALU6.cells, Lu.SC.H69 
3.07 293T_cells, MB-MDA453, Caco2 
293T.cells, LNCaP.ceBs. PC3_ceiis 
293T.cells, DU145.ceHs, EB.cells 
Caco2, EB.cells, MB231_cells 
Lu.SC.H345, LNCaP.celis, DU145.ceBs 
OVCARjalls, HMEC (total RNA), HMEC 
OVCAR.celIs, UJCaP.ceib, 293T_ceils 

MB231.ceBs, MCF7, CALU6_ce«s 
EB cells, DU145_ceils, DU145_cells 
LNCaP.celis, HS578T.cells, Lu.SQ.H520 
EB ceCs, FforoWasts2, PRSC.con 
OVCAFLceiis, LNCaP.ceils, 293T_ceiis 
EB ceils, OVCARjslls, 293T cells 
LuJiD.358, MCF7, MB231.cells 
293T cells, HS578T.celis, CALU6 cells 
Lu.SC.H345, Lu_SC^69. PRSCJog 
OVCAR.cells, LNCaP.ceBs, DU145.celb 
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111018 
1231% 
133229 
130405 
107881 



N54067 
AM 89250 
AA203433 



105479 
115550 
130166 
123355 
109546 
129001 
102259 
105563 
131859 
114533 
110220 
124917 
127111 
134882 
121788 
128530 
128435 
113782 
127569 
109642 

114615 
126808 
113947 
129455 
107772 
127159 
124792 
109751 



106637 
132164 



104818 
126050 

116695 
135204 
134946 
114975 
113792 
102322 
125642 
100286 
105876 
125262 
114419 
130639 
130972 
126906 
121807 

105474 
122348 



AA025567 

D59570 

AA255546 

AA393812 

AA35Q690 

AA504773 

F01449 

AA448946 

U28369 

AA278907 

M90657 

AA053401 

H23543 

R91241 

AA805726 

M73762 

AA423968 

AA504343 

AI301201 

W15580 

AA588536 

F04465 

AA083812 

AA086320 

W84768 

W27301 

AA018587 

AA284097 

R44357 

F10210 

AA481403 

AA459961 

U84573 

AAS05327 

AA034947 

H27257 

1=09780 

AA421146 

AA406534 

AA250850 

W35212 

U34S62 

AI096849 

D43951 

AA400184 

W88755 

AA011448 

D59711 

AA370302 



135143 
106711 



132139 
114484 
124620 
100403 
-129795 
128258 
102662 
132232 
106111 
123963 
122783 
112788 



AA424507 

AA255440 

AA443695 

AA521186 

AA102644 

AA464741 

L32832 

AA213410 

AA034378 

N74051 



HS.3628 

Hs.59403 

Hs.6834 

Hs.155396 

Hs.61273 

Hs.17132 

Hs.23467 

Hs.50575 

Hs.151411 

Hs.160657 

Hs.26954 

Hs.107812 

Hs.82222 

Hs^4549 

Hs.3337 

Hs.177526 

Hs.27090 

Hs.75470 

Hs.220509 

Hs.90638 

Hs.178113 

Hs.183475 

Hs.147112 

Hs.15342 

Hs.191783 

Hs.22394 

Hs.159456 

Hs.141742 

Hs.187991 

Hs.40515 

Hs.237955 

Hs.132784 

Hs.6679 

Hs.107213 

Hs.250824 

Hs.41270 

Hs.24831 
Hs.75860 

Hs.66124 

Hs.183418 

Hs.193053 

Hs.13944 

Hs.17691 

Hs.54473 

Hs^5274 

Hs.153834 

Hjl24656 

Hs.108514 

Hs.106532 

Hs.17132 

Hs.21739 

Hs.168069 

Hs.247478 



mltogen-activated protein kinase kinase 3.01 

serine palmitoyltransferase; subunit II 3 

WAA1014 protein 3 

nuclear factor (erythroki-derived 2)-lik 3 

H sapiens chromosome 19; cosmtf R32611 3 

ESTs 3 

ESTs 2,99 

ESTs; Moderately similar to III! ALU SUB 299 

K1AA0916 protein 2.98 

ESTs 2.98 

ESTs 2.97 

ESTs; Weakly similar to profine-rich pro 2.97 

sema domain; Immunoglobulin domain (Ig); 2.97 

ESTs 296 

transmembrane 4 superfamfiy member 1 296 

ESTs 296 

ESTs 295 

hypothetical protein; expressed In osteo 295 

ESTs 294 

ESTs 294 

ESTs; Moderately similar to kinesin like 294 

H sapiens clone 25061 mRNA seq 294 

ESTs 293 

phosphate cytidylyltransferase 1 ; choiin 2.93 

ESTs 293 
ESTs; Weakly similar to weak similarity 
protein US)1 [Oelegans] 
DKF2P566F123 protein 
zn52d12s1 Strategene muscle 937209 H sa 
ESTs 

DKF2P564A122 protein 
ESTs; Weakly similar to III! ALU SUBFAMl 
RAB7; member RAS oncogene family 

ESTs; Weakly similar to cDNA EST EMBLT0 291 
H sapiens mRNA; cONA DKFZp586A0424 (from 

ESTs; HighlyslmiiartoNY-RErW antigen 2.9 
ESTs 

procollagen-lysine; 2-oxoglutarate 5-dio 
ESTs 
ESTs 

hydroxyacyl-Coenzyme A dehydrogenase/^ 



2.92 
292 
292 
292 
291 
291 
291 



29 
29 
29 
29 



-Coenzyme A hydratase {trifunctional pro 289 

EST 289 

ceil division cycle 2-0ke 1 (PITSLRE pr 289 

EST s; Weakly srmOar to h'rwi [H.sapjens] 288 

adrenergic; beta; receptor kinase 2 288 

ESTs; Weakry similar to env protein pis 2.88 

cardrac*specffic homeo box 2.88 

ESTs; Moderately similar to putative sev 2.88 
Human mRNA for K1AA0099 gene; complete c288 

WAA0907 protein 288 

ESTs; Highly similar to Trio [H^apiens] 288 

ESTs; Weakly similar to transpose* LRE2 268 

ESTs 287 
H sapiens mRNA; cDNA DKFZp586M518 (from 

ESTs; Highly similar to CALCfLfM-BtNDING 287 
H sapiens Mut S homolog 5 gene; partial 
1C7; LST-1; lymphotoxin beta; tumor necr 
F-box protein FBL11 
ESTs 
ESTs 

K1AA1096 protein 



287 
287 
287 
286 
286 



Hs.219614 
Hs.231476 I 
Hs.94217 
Hs.69559 

Hs. 143187 Human DNA from chromosome 19-specitlc co 286 
Hs.101842 ;~ 
Hs.111554 , 
Hs.252351 
Hs.194092 



AA448627 
T70214 
U70321 
AA252030 
AA421638 
C13961 
AA459895 
R96586 Hs.163630 



Hs.125163 

Hs.183548 

Hs.130227 

Hs.42640 

Hs.6451 

HsJ!10115 



AT-Wndlng transcrfptton factor 1 2.85 

AIP«ribosylation factor-fike 7 285 

HERV4UTR-assoctating2 2.85 

ESTs; Weakly simBar to 1111 ALU SUBFAMl 2.85 

H sapiens mRNA for LIM domain, partial c 2.84 

ESTs; Weakly simfiar to HI! ALU SUBFAMl 284 

ESTs 284 

tumor necrosis factor receptor superfami 284 

ESTs 284 

ESTs 2.83 

EST 283 

ESTs 283 

ESTs 282 



PC3 cells, Caco2, Fibroblasts 2 
Lu.SC.H345, BT474_cells, LU.SCJ169 
OVCAR_cells, 293T.cells, EB.cells 
PRSC.con, EB.cells, DU145_ce0s 
LILSCLH520, MCF7, UUDJ5B 
EB.cells. A549.cells, HS578T.cells 
LU.SCJW5, PC3_oeIis ( OVCARjslIs 
EB.cells, Lu.SCJi69, Fibroblasts 2 
LNCaP.ceils, EB_cefls, 293T_celIs 
PRSC.con, PRSC_tog, PRSCJog 
Lu„SCJC45, HT29_cells 1 BT474_cefls 
EB.cells, LuJ\D_H23, LuJU>.358 
EB.cells, MB231_ceHs, OVCARjalls 
EB.cells, DU145.cells, 293T.cells 
A549_ceHs, PC3.celIs, DU145_celJs 
293T_ceils, Lu_LCJM60 t PC3.cells 
PRSCJog, Lu.SC.H345, MB231.ce0s 
LiLSCjmS, Lu_SC_H69, PRSCtog 
HS578T_cel]s, 293T_cefls, 293T_ceJis 
EB.ceBs, MB-MDA-453, Fibroblasts 2 
HT29.ceBs, CALU6_ceDs, HMEC 
DU145.cefls, Lu_$C_H345, Caco2 
EB.cells, Lu_SCLH520, PRSC.con 
EB.ceDs, Luj*DH23, PRSCJog 
EB.ceDs, HS578T_ceBs, UlADJSB 

PC3_ceHs, EB.cells, OVCARjsHs 
A549 ceils, HS578T_cefls, PRSC.con 
Lu.SC.H69, Lu.SC.H345, EB.cells 
DU145_cefls, Fibroblasts 2 MCF7 
OVCAR_ceils f DU145.ce»s, CALU6.celis 
OVCAR_ceIl3, EB.cells, PC3_ce0s 
293T_cefls, OVCAR.ceOs, PC3_ce!ls 
DU145_ceDs, DU145.ceBs, CALU6.celIs 
251 EB.cells, Lu_SC_H69, 293T.ceIls 
CALU6.oells, EB.cells, OVCAR_ceBs 
EBjceBs, Caco2, MB-MDA-435s 
DU145.ceDs, HS578T.celIs, A549.ceDs 
MCF7, HMEC (total RNA), 293T_ceIIs 
EB.ceBs, Uj_LC.H460, 293T_ceHs 

LNCaP.ceBs, DU145_ceIis, OVCAR.ce0s 
CALU6.ceDs, 293T.ce8s, 293T_ceHs 
PC3_cefls, EB.cells, LNCaP.ceils 
EB.cells, LNCaP.ceDs, Caco2 
EB.cells, EB cells, EB.cells 
MB-MDA-435s, Lu_SC.H69, CAUU6.ceIls 
293T.ceIls, HT29_ceDs l UtADJi23 
PC3_ceBs, CALU6_celis, 293T.ceDs 
293T ceOs, LNCaP.ceils, EB.cells 
OVCAR.cells, DU145_celIs, 293T.ceiis 
DU145 cells, HS578T_ce0s, MB231.cells 
EB.ceDs, Lu.AD.H23, Fibroblasts 2 
EB.cells, A549_cells, OVCAH_cells 
287 293T.cells t A549_cete, Lu_LC.H460 
Lu_SC_H345, Lu.SCj.H69, LNCaP.ceils 

LU.SCJG9, HT29_cells, RPWEJ 
LuJULH23, Caco2, EB.cells 
HT29_ce0s, Lu_SC.H69, BT474_ceiIs 
MB-MDA-453, OVCARjsflS, Lu_SC.H69 
PC3.cells, EB.cells, 293T_celis 
EB.cells, Lu_AD_H23, Lu_LC_H460 
LNCaP.ceils, Caco2, EB.cells 
A549 cefis, HS578T.oeBs, Caco2 
PC3 cells, Lu.SQ.H520, MB231.ceBs 
Lu_SC_H345, MB231.cefls, Fibroblasts 2 
Lu_AD_358, UlAD.358, MB231.ceDs 
Lu.SC.H345, OVCAFLcelb, PC3.oells 
DU145.ceDs, DUWS.ceDs, OVCARjjelis 
EB cells, LuJU)_H23, Fibroblasts 2 
EB cells, OVCAR.ceDs, LU.SCJ1345 
EB ceOs, Lu.LC H460, OVCAR_cetls 
DU145.ceils, LNCaP.ceils, LU.SCJI345 
EB.cells, MCF7, Lu.SCJi69 
DU145.ceas, Lu.SCJ<345, EB.cells 
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12)823 
100378 
114677 
108085 
104938 
128743 
124314 
134227 
122922 

126096 
129295 
116155 
105911 
119232 
131168 
106048 



AA347546 

D80009 

AA114163 

AA0456Q2 

AA064627 

AA237013 

H94877 



106120 
100643 
128500 



127064 
132989 



119579 
100387 
114744 
129092 
125360 
107874 
114086 
116180 
126027 
116339 
105387 
111359 
106680 
118598 
107913 
134315 
135233 
112932 
119053 

131206 



131060 
132135 
120835 
113815 
133234 
126819 
125198 
108394 

134456 
111720 
114617 
127787 
101437 
133761 



125191 
116238 
124770 
101764 



AA47626B 

H42968 

AA424762 

AA460957 

AA401809 

T03475 

AA4B2007 

AA416697 

N21626 

D86974 

AA423608 

HG275541 

U60521 

R44789 

Z43709 

AA480074 

AA135606 

W42429 

D83777 

AA135407 

AA011243 

AA677978 

AA025305 

Z38266 

AA463902 

M61982 

AA496257 

AA236951 

N91273 

AA461458 

N69136 

AAQ27161 

AA136269 

AA127463 

T15470 

R11501 

AA044078 

AA063642 

AA160890 

N69101 

AA348446 

W45311 

T90092 

AA305536 

W69474 

AA075144 

X59405 

R23739 

AA084148 

AA731764 

M20681 

AA477223 

AA399574 

W67257 

AA479362 

R40555 

M80563 



Hs.185780 ESTs 2.82 

Hs.10848 KIAA0187 gene product 2.82 

Hs.188877 ESTs 2.81 

Hs.62863 ESTs; Moderately simitar to serine/tfveo 2.81 

Hs.18341 ESTs; Htgroy similar to CGI-72 protein [ 2.81 

Hs.2730 heterogeneous nudearrtaiicteoprotein 2.8 

Hs^15766 GTP-blnding protein 2.8 

Hs.80338 WAA0164 gene product £8 
zw44h1.s1 Soare9„totaLfetus.Nb2HF8.9wH 

contains Atu repetitive eiementcontaln 2.79 

paired mesoderm homeo box 1 2.78 

SEC7homolog 2.78 

DEAD/H (Asp£lu41a-Asp/Hls) box poJypep 2.78 

ESTs £77 

EST 2.77 

ESTs; Weakly simfiar to homology with Is 2.77 

ESTs £76 

ESTs 2.76 

KIAA0220 protein 2.76 

RNAhelicase-related protein 2.76 

T-Plastn £75 

caspase 9; apoptosis-reiated cysteine pr 2.75 

ESTs; Weakly simitar to rostral oerebetl 2.75 



Hs.155606 
Hs.1 10121 
Hs.76053 
Hs.189910 
Hs^58624 
Hs.23788 
Hs.1 5330 
Hs.102406 
Hs.110613 
Hs.8765 

Hs.100641 
Hs.1 19486 

Hs.394 
Hs.189384 
Hs.150607 
Hs.75137 
Hs.252351 
Hs.63525 
Hs.189741 
HSL25218 
Hs.12770 
Hs.94964 

Hs.72165 
Hs.108636 
Hs.27179 
HS.247B9 
Hs.214343 
Hs.59523 
Hs.81648 
Ks.9683 
Hs.189810 



Hs.24210 

Hs.22564 
Hs.40730 
Ha.96906 
Hs.14756 
Hs.6853 
Hs.161489 
HSJ225550 



Hs.83532 



Hs.110659 ESTs 



HSC1 JA091 normalized infant brain cDNA H £75 

adrenomedulSn 2.75 

ESTs; Weakly similar to iili ALU SUBFAMI 2.75 

ESTs £74 

KIAA0193 gene product £74 

HKV^LTR-assodafing2 £74 

poly(rCHjinding protein 2 £74 

ESTs £74 

ESTs; WeaWy similar to reverse transcri £74 
H sapiens PAC ctone DJ0777O23 from 7p14- £74 

ESTs £73 

ESTs 2.73 

ESTs; Weakly simfiar to R28984J [H^apl £73 

chromosome 1 open reading frame 9 2.72 

ESTs £72 

ESTs £72 

ESTs 2.72 

EST s; Highly similar to G1 TO S PHASE TR £71 

ESTs; Weakly similar to S164 [Rsapiens] 2.71 

proteWtinase; interferon-inducibte dou £71 

ESTs £7 
yf28fl.s1 Scares fetal liver spleen 1 NFL 

contains Aiu repetitive element;, mRNA £7 

ESTs £7 

ESTs; Highly similar to (defline not ava 2.7 

myosin VI £7 

ESTs £7 

ESTs 2.7 

ESTs £7 

ESTs; Weakly similar to HI! ALU SUBFAMI £69 

ESTs 2.69 

ESTs £69 
zm86$.s1 Stratagene ovarian cancer (#93 
gb:X1664 TRANSLATtONALLY CONTROLLED TUM 
membrane cotactor protein (CD46; trophob 
K1AA1078 protein 



£68 
2.68 
£68 
£68 
£68 
2.68 
£68 
£67 
2.67 



130897 AA063428 

133303 H61046 

124724 R12405 

123697 AA609601 

111548 R09170 

107005 AA598679 

105569 AA278399 



ESTs; Weakly similar to Hi! ALU CLASS C 

Hs.7594 solute carrier family 2 {facEtated gtu 

Hs.75922 brain protein 13 

Hs.19086 ESTs 

Hs.138871 ESTs; Weakiy similar to tilt ALU CLASS B 

Hs.47144 DKFZP585N0819 protein 

Hs.120429 ESTs 

Hs.81256 S100 calcium-binding protein A4 (caicium 

murine placental homolog) £67 

Hs.21022 adaptor-related protein complex 3; beta 2.67 

Hs.237352 EST 2.66 

Hs.1 12423 H sapiens mRNA; cDNA DKFZp586l1420 (from 

Hs.221224 ESTs 2.66 

Hs.258707 ESTs £66 

Hs.194215 ESTs 2.66 

Hs.20596 ESTs 2.65 



HT29_ceils, HMEC (total RNA), BT474_ce»3 
Caco£ PC3_ceUs, OVCAFLcefls 
DU145_cells, MCF7, EB.ceiis 
EB.ceiis, LuJU)_H23, HT29.ceils 
PC3_ceils, HS578T.ce0s, OVCArVcelb 
OVCAR_caiis, UCaP.cefis, Caco2 
LNCaPjalls, DU145.cens, Caco2 
LNCaPjslls, A549_ce!ls, EB.ceiis 

Lu.SC.H345, OVCAR_ceiis, Lu.SC.H69 
UlAD_H23, Lu_SC_H69, LUJLC_H460 
LuJ\D_H23, EB.ceiis, Lu.SC_H345 
EB.ceiis, OVCAR.ceIls, 293Lcells 
293T_ceBs, HS576T_ceils, DU145_cefls 
EB.ceiis, LlAD.H23, UlAD_358 
EB.cetts. UU-CJ1460, MCF7 
OVCARjalfe, Lu_SC_H345, 293T_ceUs 
MCF7, MB-MDA-453, CALUB.cells 
DU145_ce[Is, HT29_cefls, Lu.SC.H69 
OVCAR_ceBs, EB.ceiis, 293T.ce0s 
293T.ceDs, PC3.ceHs, HS578T_cells 
UlAD.358, Lu.SC.H69, Lu.SC.H345 
Lu.SC.H69, Lu.SC.H345, BT474.ceiis 
08002, A549.cefc, HT29_ceiis 
EB.ceiis, OVCAR.ce0s, DU145_oelis 
OVCAFLcefis, LNCaP.cells, DU145_cells 
293Tjsils, DU145_ce0s, PC3_ceils 
CALU5_cefe, DU145.cefls, Caco2 
PC3_celIs f Lu__SQ_H520, RPWEJ 
EB.ceiis, MCF7, DU145.cells 
UlAD_35B, LUJUD.358, PRSCJog 
Lu.SC.H345, Lu.LC.H460, HT29_oells 
EB.ceiis, WCaP.ceBs, BT474_ce«s 
Lu.SC_H69, PRSC.con, LuJtf)_H23 
LNCaP.cells, DU145.cells. A549_ceDs 
EB.ceiis, DU145.ceKs, OVCAR.ceJ!s 
PC3_ceJls, EB.ceiis, Caco2 
EB.ceiis, LNCaP.cells, 293T_cells 
PC3_cells, Lu.SC.H345, Caco2 
MB-MDA-453, 293T_csBs, BT474_cefls 
EB.ceiis. MCF7, Lu_SC_H345 
EB.ceiis, DU145_ceDs, HMEC 
EB.ceiis, OVCARjceDs, Caco2 
293T.ceHs, UlAD_H23. PC3.celis 

Lu.SC.H345, UlSC_H69, DU145_cells 
Caco2, Lu.SC.H345, HS578T.ceiis 
LNCaP.cells, Lu.SC.H345, Lu.SC.H69 
LNCaP.cells, MCF7, HT29.cells 
EB.ceiis, 293T_ce0s, OVCAFLcefls 
Fibroblasts 2, CALU6.ceQs, RPWE.2 
EB.ceiis, PC3 cells, DU145 ceOs 
Lu.SC.H345, 0VCAR_ceilsr0U145.cells 
EB.cefls, DU145_ceJts, Caco2 
Lu.SC.H345, ULAD.H23, LuJ\D_H23 

£69 HMEC, HMEC (total RNA), Fibrobtests 2 
EB.ceiis, LNCaP.cells, DU145_cefis 
PC3.celb, HMEC (total RNA), OVCAR_ceiis 
DU145_cells, LNCaP.cells, OVCAFLcefls 
HT29.cefls, Lu_SC.H345, MB231.cells 
Caco2, UJLCJH460. Fibroblasls2 
EB.ceiis, Lu.AD.H23, Lu.SC_H345 
PC3.cefls, MCF7, MB231.ceils 
OVCAR_ceIis, DU145.ceBs, LNCaP.cells 
OVCAR.celis, DU145_ceiis, LNCaP.cells 
Lu_AD.H23, Lu.SC.H69, PRSC.con 

A549 cells, MB231.ceiis, OVCAR_ce»s 
EB.ceBs, LuJU)_H23, HMEC 
Lu.SC_H345, Lu.SC.H69, PRSC.con 
£66 Lu.SC.H345, BT474.ceHs, OVCAR_ceils 
OVCAR_cefls, 293T.cells, Lu_SC_H69 
293T ceils, CALU6_celts, A549_celis 
ULSC.H345, OVCAR.ceiis, Lu.AD.H23 
MCF7, HT29.cells, BT474.cells 
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132687 AB0Q23O1 Hs54985 WAA0303 protein 165 

104105 AA422123 Hs.42457 ESTs 2.65 

121335 AA404418 Hs.144953 ESTs 2.65 

124853 R61693 Hs.172330 ESTs; Weakly similar to WbkotkAWrich 2.64 

124253 H89742 Hs.102201 ESTs 2.64 

123044 AA481549 Hs.165694 ESTs 2.64 

12535 AA608852 Hs.112603 EST 164 

131397 AB002336 Hs.26395 erythrocyte membrane protein band 4.1-11 2.64 

130175 X75593 Hs.151536 RAB13; member RAS oncogene family 164 

127507 AI188445 Hs.152618 ESTs 2.63 

105377 AA236702 Hs^4371 ESTs 2.63 

114671 AA1 12679 Hs.252291 ESTs; Weakly similar to !!!! ALU SUBFAMI 2.63 

133726 W19983 Hs.75761 SFRS protein kinase 1 2.63 
132380 H68018 yr76h05.r1 Soaresfeta) liver spleen 1NF 

IMAGED 1 1257 5 1 , mRNA seq. 2.62 

127986 AI37041 8 Hs, 192050 ESTs; Weakly simitar to till ALU CLASS A 2.62 

116208 AA476333 Hs.42532 ESTs 2.61 

130946 AA059455 Hs.21490 KIAA0438 gene product 2.6 

106687 AA463234 Hs.119387 WAA0792 gene product 259 

101551 M31606 Hs.196177 phosphorytase kinase; gamma 2 (testis) 2.59 

114479 AA032084 Hs.124841 ESTs;Moderaterysfmilajtotrarisfbn^ 259 

111863 R37495 Hs.23578 ESTs 159 

129018 AA029973 Hs.107979 small membrane protein 1 2.59 

107058 AA600357 r&239489 T1A1 cytotoxic granule-associated RNA-bi 258 

. 126175 AA056181 Hs.17311 DKFZP434N161 protein 2.58 

131979 D52154 Hs.172458 iduronate 2-sulfatase (Hunter syndrome) 2.58 

126122 H80181 ESTs 258 

106961 AA504110 Hs.18063 ESTs 2.58 

114730 AA133527 Hs.126925 ESTs; Weakly similar to The KIAA0138 gen Z58 

117342 N24020 Hs.132913 ESTs 2.58 

131622 AA424813 H&29692 ESTs 257 

104904 AA055560 Hs.13179 ESTs; Moderately similar to ALU SUB 157 

117359 N24848 Hs.114062 ESTs; Weakly similar to T1 5B7.2 [Cetega 257 

123331 AA497013 Hs.188740 ESTs; Weakly similar to ill] ALU SUBFAMI ' 157 
125324 R07785 yf15o06j1 Scares fetal Bver spleen 1NF 

contains Alu repetitive elarnentjcontain 2.57 

129813 T33462 Hs.12600 ESTs 2.57 

100265 D38521 Hs.75935 WAA0077 protein 257 

134890 T40902 Hs.90786 ATP-binding cassette; sub-family C (CFTR 2.57 

133582 AA421874 Hs.75087 Fas-activated serine/threonine kinase 2.56 

135011 H73161 Hs.92991 ESTs; Weakly similar to C13F10.4 [Celeg 2.56 

107226 D58185 Hs.21945 ESTs 156 



126042 H62441 Hs. 157082 H sapiens PAC done DJ0988G15 from 7q33- 2.56 

114472 AA028924 Ks.177407 ESTs; Weakly similar to III! ALU SUBFAMI 2.56 

126291 N42090 yy05b07.r1 Scares melanocyte 2NbHM H sap 156 

113349 T79021 Hs. 14438 ESTs; Moderately simferto histamine N- 156 



105769 AA347485 Hs25477 ESTs; Moderately sim9ar to rtg-1 protei 2.56 

110918 N46423 Hs.24283 ESTs 156 

117170 H98153 Hs.42500 ADP-rfbosylation factor-fike 5 156 

105159 AA173981 Hs.30490 CD2^ssodated protein 155 

105726 AA292328 Hs.9754 activating transcription factor 5 155 

132079 H67964 Hs.33694 ESTs 155 

131813 X51757 Hs.3268 heat shock 70kD protein 6 (HSP70B 1 ) 155 

133538 L14837 Hs.74614 tight junction protein 1 (zona ocdudens 154 

124981 T40849 Hs.1 14034 maternal G10 transcript 154 

122028 AA431306 Hs.98722 ESTs 154 

122487 AA448332 Hs.80598 transcription elongation factor A (Sll); 154 

119315 T41152 Hs.90485 ESTs 154 

107957 AA031948 Hs.57548 ESTs 2.54 

122457 AA447780 Hs.96418 ESTs 154 

103572 Z25749 Hs.75538 ribosomal protein S7 254 

124395 N29963 Hs.193977 ESTs 154 



116024 AA451748 Hs.83883 Human DNA seq from done 718J7 on chromo 

phosphoenotpyiuvate carboxykinase 1 ; ES 253 

134361 043682 Hs.82208 acyt-Coenzyme A dehydrogenase; very long 153 

130420 U60975 Human hybrid receptor gp25 precursor mRN 153 

100336 D63478 Hs.8127 KIAA0144 gene product 153 

105519 AA258063 Hs.23438 ESTs 153 

124684 R02401 Hs.221078 ESTs 153 

105852 AA398933 Hs.172613 solute earner family 12 (potassium/chlo 152 

105012 AA116036 Hs.9329 chromosome 20 open reading frame 1 152 

126534 W39128 Hs.247901 Human DNA seq from clone 8B1 on chromoso 

-CELL MEMBRANE GLYCOPROTEIN PCM; thege 

135334 AA053134 Hs.241558 ariadne-2 (D. metenogaster) homolog (all 252 

128538 R44214 Hs.101189 ESTs 252 

109865 H02566 Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 



HMEC (total RNA), HMEC, LNCaP.ceils 
Lu.SC.H345, LILSC.H69, DU145.ceils 
EB_celis. Rbrablasts2, DU145.ceIls 
Lu.SC.H69, 293T.cells, EBjnb 
DU145.csDs, OVCARjsfls, Lu.SC_H345 
EB.cells, Lu_SC_H69, Lu.SC.H345 
EB.cells, UiJDjm, Fibroblasts 2 
EB.ceDs, DU145_celIs, Caco2 
Fibroblasts 2, PRSC.con, HS578T_cefis 
EB.cells, ULAD.H23, LUC.H460 
Cacol EB.cells, CALU6.cells 
EB.cells, DU145_ceBs, Caoo2 
EB.cells, ULAD.H23, Lu.SC.H69 

EB.cells, LILAD.H23, Lu.SC.H69 
DU145.ceHs, OVCAR_cells, LNCaP.ceils 
DU145.cells, PRSC.con, Fibroblasis2 
LNCaP.ceils, DU145.ceils, HS578T_celis 
EB.cells, MB-MDA-453, Caco2 
LNCaP.celis, EB.cells, MB-MDA-453 
DU145.celb, Cacol OVCARjslis 
HT29 ceils, MB23LceHs, Lu.SQ.H520 
A549.cells, EB cells, HS576T.ceils 
DU145.cells, Lu_SC_H345, EB.cells 
Lu.SC.H345, DU145.cells, LNCaP.celis 
DU145.cells, PC3_cells, A549_ceife 
DU145_cells, OVCAR_cells, LNCaP.celis 
HMEC, DU145_ceQs. DU145_ce0s 
DU145.cells, LNCaP.celis, MCF7 
HS578T.C8HS, DU145.cells ( LNCaP.celis 
PRSC.con, PRSCJog, HS578T.cefls 
Lu.SC.H345, Lu.SC.H69, BT474.celb 
HS578T_cells, PRSC.con, EB.cells 
Lu_SC_H69, Cacol PRSC.con 

EB.cells, Lu_AD_H23, Fibroblasts 2 
LU.SCJH345, 293T.ceHs t Lu.SC.H69 
EB.cells, LNCaPjelb, PC3.ceDs 
A549_ceHs, DU145 ceils, EB.ceDs 
EB.cefls, LuJU}_H23, Lu^AD.358 
EB.cells, LNCaP.ceils, MB-MDA-453 
Lu.SC.H345, Lu.SC.H69, HMEC (total RNA) 
HMEC (total RNA), HMEC, RPW^2 
Lu.SC.H345, Lu.SC.H69, DU145_ceDs 
HMEC, HMEC (total RNA), PC3.ceUs 
HT29_ceto, PRSCJog, Lu.SC_H345 
Lu __AD _H23, RPWE J, LU.SCLH520 
EB.cells, CALU6_celts, DU145.cefe 
OVCARjalls, EB.cells, LNCaP.ceils 
LNCaP.celis, EB.cells, DU145_cete 
MCF7, EB.cells, MB-MJA-453 
EB.cells, DU145.ceIls, HS578T_ceBs 
Lu.AD_H23, MB231.cells. Fibroblasts 2 
DU145_ceHs, Cacol A549.oefls 
EB.cefls, Cacol LNCaP.celis 
Fibroblasts 2, BT474.celb. HMEC (total RNA) 
Lu.SC.H345, MCF7, MB-MDA-453 
Lu.SC.H345, MB-MDA435S, PRSC.con 
A549_ceBs, RPWEJ, DU145.ceBs 
DU145.cells. EB.cells, A549.ceDs 
EB.cells, CALU6_ce0s, DU145_cells 
HMEC (total RNA), HMEC, RPWEJ 

LNCaP.celis, RPWEJ, MB-MDA-453 
LNCaP cells, CALU6.ceIEs, DU145_ceDs 
EB.cells, HMEC (total RNA), Caoo2 
BT474_ceDs, HT29_ceIis, UlAD_358 
EB.cells. Cacol MB-MDA435s 
Lu.SC.H345, OVCARjalb, Lu_SC_H69 
LNCaP.ceils, DU145_cells, EB.cells 
CALU6.cete, Cacol DU145_ce0s 

252 BT474_ceIls, LNCaP.ceils, Lu_AD_H23 
293T.cells, CALU6.ceBs, DU145_cefis 
EB.cells, Lu_AD_H23, Lu_SC_H345 
252 DU145_oells, LNCaP.celis, OVCARjsUs 



93 



WO 02/068677 



PCT/US02/06001 



116579 N68905 small inducible cytokine A5 (RANTES) 2.51 

117590 N34904 ESTs; Moderately similar to ALU SUB 251 

104340 F15201 ESTs Z51 

122455 AA447744 Hs.99141 EST 251 

109339 AA211901 Ks.86430 ESTs 151 

123258 AA490929 Hs.105274 ESTs 251 

118467 N66763 Hs.43080 ESTs 2.51 

106044 AA416546 Hs.149436 klnesln family member 5B 231 

107480 W58057 Hs.74304 periplakin 23 

111760 R26892 Hs^21434 ESTs 25 

132474 N88018 Hs.180930 TBP-assodated factor 172 2.5 

103423 X97249 Hs.123122 FSH primary response (LRPR1; rat) homoto 2.5 

123488 AA599708 Hs.187764 ESTs; Weakly similar to IUI ALU SUBFAMI 2.49 

100475 D90276 Hs.12 carcmoembryontoantigerHelatedceUad 2.49 

112003 R42547 Hs.172551 ESTs 249 

114315 Z41027 Hs.26297 ESTs 2.49 

105291 AA233311 HS.2B752 ESTs 249 

135354 AA188934 Hs.99367 ESTs 2.49 

107521 X78262 Ksaplere mRNA for TRE5 2.49 

108373 AA074393 Hs.61950 ESTs; Weakly similar to nuclear protein 249 

108836 AA132061 Hs.222727 ESTs; Weakly similar to ubiquitous TPR m Z48 

110386 H45516 Hs.33268 ESTs Z48 

129858 M22348 Hs.131255 ubiquinokytochromec reductase binding 2.48 



134283 H12661 Hs.8107 H sapiens mRNA; cONA DKFZp566B0918 (from 



101844 M93425 Hs.62 protein tyrosine phosphatase; non-recept Z48 

133461 M33318 Hs.183584 cytochrome P450; subfamily llA (phenobar 2.48 

103545 214000 Hs.35384 ring finger protein 1 2.47 
128440 N76763 ESTs 247 

134992 H05625 Hs.92414 ESTs 2.47 

116295 AA489016 Hs.91216 ESTs; Highly similar to partial CDS; hum 2.47 

107004 AA598675 H&239475 ESTs Z47 

132137 AA282312 Hs.4076 CTD (carboxy-terminal domain; RNA potyme Z46 

128390 W28286 Hs.100090 tetraspan3 Z46 

113050 T26366 Hs^2711 EST; Weakly similar to 60S RIBOSOMAL PRO 

101667 M60858 Hs.79110 nudeoGn Z46 
108569 AA085398 zn7e3.s1 Stratagene hNT neuron (#937233) 

!MAGE:5467483\mRNAseq 2.45 

117186 H98988 Hs.42612 ESTs Z45 

129091 AA044622 Hs.183755 Human Chromosome 16 BAC done CIT987SK-A 

128468 723625 Hs.258674 EST Z45 

117498 N31726 Hs.44268 ESTs; Highly simQar to myelin gene expr 2.45 

105407 AA243478 Hs.5206 ESTs 2.45 

128941 R55763 Hs.107287 ESTs 2.44 

116486 C14128 Hs.251980 EST Z44 

134869 T35288 Hs.90421 ESTs; Moderately simQar to III] ALU SUB Z44 

130664 R09049 Hs.17625 ESTs Z44 

107985 AA035638 Hs.71968 H sapiens mRNA; cONA DKFZp564F053 (from 

110300 H37820 Hs.124147 ESTs Z44 

113471 T87174 Hs.16341 ESTs; Modefately simQar to llll ALU SUB Z44 

131474 U28749 Hs2726 higrwnobaity group {nonhistone chromoso Z44 

120791 AA342802 Hs.194031 ESTs Z44 

133733 AA416973 Hs.75798 Human DNA seq from done 1183121 on chro 

to predicted fly and worm proteins. Con Z43 

119977 W8B579 Hs.124744 ESTs Z43 

134921 W60186 Hs.1 69487 Kreisler (mouse) mai-related ieudne zip Z43 

132295 H66351 Hs.181042 Dmx-llke1 Z43 

133395 AA491296 Hs.72805 ESTs Z43 

106728 AA465355 Hs.153768 U3 snoRNP-associated 55-kDa protein Z43 

116370 AA521256 Hs336204 ESTs; Moderately simflar to NUCLEAR PORE Z43 

113936 W81552 Hs.B3B23 nu^earre(^torsubfarni}y1;groupl;m Z43 

128862 R61297 Hs.106673 eukaryotic translation initiation factor 2.43 

111614 R12581 Hs.191146 ESTs Z43 

111993 R42241 Hs.106359 ESTs ? Z43 

131554 AA100026 Hs.28669 ESTs; Weakly similar to PROTEIN-TYROSINE 

130983 N71215 Hs.21862 NCK-assoriated protein 1 Z42 

131654 AA497050 Hs.30204 ESTs 2.42 

105014 AA121123 Hs.191374 ESTs 2.42 

106300 AA435840 Hs.19114 hIgh-mobiOty group (nonhistone chromoso 2.42 

102386 U40998 Hs.81728 unc119 (C.elegans) homotog 2.42 

112517 R68589 Hs.23721 ESTs 2.42 
125375 H72971 WAA0277 gene product 2.42 

123808 AA620552 Hs.25682 ESTs; Weakly simflar to PHOSPHATIDYLETHA 

114950 AA243503 Hs.11801 adenosine A2b receptor pseudogene 2.42 

129906 H39216 Hs.239970 ESTs; WeaWy similar to ZNF91L [H.saplen 241 

103408 X95876 Hs.198252 G protein-coupled receptor 9 Z41 

129703 AA401348 Hs.179999 ESTs 2.41 



Lu.SC_H345, LNCaP.cells, Lu.SC_>*69 
Lu.SCJi345, DU145_cells, Lu.SC.H69 
Lu.SC.H345, PRSC.con, PRSCJog 
Caco2, Lu_SC_H69, 293T_ce!ls 
EB.cells, DU145_ceHs, CALUB.ceOs 
EB.cells, LilAD _H23, Lu.SC_H69 
CALU6.cells, HS578T_ceds, OVGAR.ceils 
EB cells, Caco2, DU145 cells 
Caco2, OVCAR.ce!ls, HMEC (total RNA) 
LuJ\D_H23, EB.cells. UlAD_358 
LNCaP cells, EB.cells, DU145.ceBs 
HS57BT.cells, Lu.SC_H345, PC3_celis 
OVCAR_cells, Lu.SC.H345, DU145.cells 
MB-MDM53, 293T_ceDs, CALU6_celis 
EB.cells, Luj\D_H23, Lu_SC_H345 
Lu.SC H69, OVCAR^cefls, UlAD_H23 
EB.cells, CALU6_ceHs, DU145_cells 
MB-MDA-453, Lu_SC.H69, 293T_cells 
Lu.SC.H345, Lu_SC_H69, PRSC.con 
MCF7, MB-MDA453, Lu_SC_H345 
DU145_ce0s, Lu.SC.H345. Lu.SC.H345 
PC3_cefls, OVGARjaBs, Lu.SO.H520 
LNCaP.cells, CALU6.ceIls, PC3_cells 
Z48 HMEC (total RNA), HS578T.cel!s f HMEC 
DU145.cells, EB.celts, CALU6.cells 
EB.cells, hiJD_H22, LuJVD.358 
HT29.cells, Lu.SO.H520. BT474_cefts 
EB.cells, LuJtf).H23, LuJU3.358 
Lu.SC.H345, CALU6_cells, Lu.SC.H69 
MB-MDA-453, 293T.ceIls, MB-MDA-435S 
LNCaP.cells, Caco2, OVCAR_cefls 
Lu_SC_H69, HMEC, EB.cells 
EB.ceDs, DU145.ce0s, LNCaP.ceBs 
Z46 Lu.LC.H460, EB.cells, LilAD.358 
PC3.ceHs, 293T.cells, A549_ceBs 

HT29.ceUs, BT474.ceIls, Lu_SQ_H520 
EB.cells, Lu_AD_H23, UlAD.358 
2.45 EB.cells, LLLAD.H23, LilAD_H23 
\jjJD_H23, EB.cells, Lu_SC_H69 
Lu.SC.H69, DU145.ceBs, OVCAR_ceDs 
EB.cells, 293T.ceJls, PC3.ceBs . 
EB.cells, LNCaP.ceBs, A549_cels 
MB-MDA-435S, HS578T.cete, 293T_cete 
EBLcells, Lu.AD.H23, LilAD.358 
PC3.ceBs, EB.ceQs, A549.ceBs 
244 PRSCcon, PRSCJog, Caco2 
MB-MDA453. Caco2, OVCAFLcete 
CacoZ OVCAR_ceQs, LNCaP.cells 
CALUe.ceHs, OVCAR.ceBs, 293T_ceils 
hiJDjm, LU.SCLH520, PRSC.con 

EB.cells, Caco2, DU145_ceHs 
HT29_ceus, HMEC (total RNA), HMEC 
LNCaP.cells, HS578T_ceHs, MB-MDA-453 
Lu_SCJ*69, BT474_ceHs, U.SCLH520 
EB.cells, LNCaP.cells, OVCAR_ceils 
EB.cells, Lu_AD_H23, PC3_ce0s 
EB.cells, A549.ceBs. 293T.ceHs 
293T.ceils, OVCAR_ceBs, Fibroblasts 2 
EB cells, DU145.ceils, DU145_ceQs 
HMEC (total RNA), Fibroblasts 2 MB-MDA-435S 
A549.cefis, DU145_ceHs, CALU6_cefls 
2.43 EB.ceDs, LNCaP.cells, Caco2 
EB.cells, Caco2, A549_cells 
MCF7, MB-MDA-435S, Lu.SCJi345 
EB.cells, LlAD_H23, Lu_LC.H460 
EB_celIs, Lu.SCJi345 f A549.celb 
OVCAfLcells, EB.cells, DU145.ceIis ' 
Caco2 MCF7, DU145_ceEs 
Lu SC.H345, OVCAR^ceBs, Lu_SC.H69 
242 EB.cells, LuJ0)JH23, Lu.SC.H69 
MB-MDA-453, HT29_ce!ls, UULC.H460 
Lu.SCJ<345, Fibroblasts 2, DU145.cells 
RPWE.2 PRSCJog, Lu_SC.H345 
EB.ceDs, 293T.ceDs, DU145.cel!s 
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105693 AA287104 Hs.181368 U5 snRNP-spedfic protein (220 kD);orth 141 

106532 AA453628 Hs.37443 ESTs 2.41 

132132 AA010933 Hs.4055 core promoter element binding protein 2.41 

111409 R00311 Hs.18798 EST; Weakly similar to till ALU SUBFAMIL 141 

133813 M26557 Hs.250711 dipeptidyl cartjoxypeptidase 1 (angiotens 241 

127240 AA888387 Hs.243845 ESTs; Moderately similar to !!!! ALU SUB 2.41 

104975 AA086071 Hs£0758 chromosome-associated polypeptide C 2.41 

118078 N54321 Hs.47790 EST 2.41 

115640 AA429253 Hs.58103 A kinase (PRKA) anchor protein 9 2.41 

101186 L20298 Hs.179881 ' corfrtlndtng fector, beta subunft 2.4 

113093 T40936 H&8349 ESTs 2.4 

115185 AA259140 Hs.60238 ESTs 14 

113778 W15263 Hs£422 ESTs 14 

128261 AEQ61213 Hs.13179 ESTs; Moderately slmSar to Hi! ALU SUB Z4 

132210 AA235013 Hs.42322 A kinase (PRKA) anchor protein 2 14 

112561 R72427 Hs.129873 ESTs; Weakly similar to CYTOCHROME P450 

127598 AA610677 Hs.168851 ESTs 2.4 

106664 AA460969 Hs.7510 rTritogernicth/ated protein kinase Idnase 2.4 

131367 AA456687 Hs.26057 ESTs 2.4 

103163 X67683 H.sapiens mRNA for keratin 4 2.39 

109639 F04444 Hs.6217 ESTs; Weakly similar to l!ll ALU SUBFAMI 2.39 

112007 R42671 Hs.140853 EST; WeaWy similar to ISIl ALU SUBFAMIL 239 

100023 AFFX control: BbC-3 239 

119923 W86214 Hs.184642 ESTs 239 

127705 AJ0033O7 AJOO3307 Selected chr 21 cDNA library H 239 

130362 AA182658 Hs.179817 DKF2P586F0222 protein 239 

100168 D14874 Hs394 adienomeduffin 239 

134261 AA227678 Hs.8084 Human DNA seq from done 465N24 on chr 1 

Contains two novel genes; ESTs; GSSs an 239 

103392 X94563 H^apiens dbi/acbp gene exon 1 & 2 238 

129688 U81001 Hs.131891 Human SNRPN mRNA; 3* UTR; partial seq 238 

130119 T12649 H&251653 tubuHn; beta; 2 238 

118136 N57710 Hs. 233952 proteasome (prosome; macropain) subunit; 238 

131163 H80107 Hs23754 ESTs 238 

115964 AA446622 Hs.74313 ESTs 238 

135026 H59730 Hs.93231 ESTs 237 

133300 D51401 Hs.70333 ESTs 237 

129948 H69281 Hs.13643 ESTs 237 

112505 R67923 Hs.23368 ESTs 137 

130715 T98227 Hs.171952 ocdudin 237 

12)301 AA192163 Hs.104085 EST 237 

128062 AA379500 Hs.193155 ESTs 237 

127154 AA789101 Hs.198860 ESTs; Weakly similar to !!!! ALU SUBFAMI 237 

102814 U90716 Hs.79187 coxsackie virus and adenovirus receptor 237 

120239 Z41691 Hs.65919 ESTs 237 

106829 AA481883 Hs.31236 ESTs; Weakly similar to Unknown [H^apte 237 

132681 AA435762 Hs.54894 ESTs; Highly similar to unknown [H^pie 237 

108845 AA132946 Hs.68864 ESTs 136 

133226 T85327 Hs.169552 ESTs 136 

106789 AA478726 Hs.26373 ESTs; Moderately similar to !Hi ALU SUB 136 

119236 T10166 Hs.237297 ESTs 136 

106619 AA459255 Hs.23956 ESTs 136 

109178 AA161600 Hs.62741 ESTs 136 

112724 R91753 Hs.17757 ESTs 136 

112655 R85069 Hs.141139 ESTs 136 

132820 AA454988 Hs.57621 ESTs 136 

106155 AA425309 Hs.33287 nuclear factor l/B 136 

114632 AA084742 Hs.194380 ESTs; Weakly stmfiar to llll ALU SUBFAMI 135 

134776 J05582 Hs.89603 mucin 1; transmembrane 135 

101192 L20859 Hs.78452 solute carrier family 20 (phosphate ban 135 

130349 W16686 Hs. 171825 basic heBx-ioop4]efix domain containing 235 

106389 AA446949 Hs.6236 ESTs 235 

109637 F04426 Hs.23131 WnesmfaniilyrrtemberC3 135 

101483 M24466 Hs.76768 procofegen-proline; 2-oxoglutarate <Wi 135 

131751 H18335 Hs.31562 ESTs 135 

131050 X13967 Hs.2250 leukemia Inhibitory factor (cholinergic 135 

130097 N21159 Hs.14845 farkhead box 03A 134 

134533 AA013468 Hs.241493 natural kflter^umor recognition seq 134 

134839 D63479 Hs.1 15907 diacylglycerol kinase; delta (130kD) 134 

115690 AA410894 Hs.44159 ESTs 134 

129079 N91011 Hs.108502 ESTs 134 

123517 AA608525 Hs.243059 EST 234 

126239 AA527215 Hs.75879 ribosomal protein L19 134 

124440 N46435 ESTs 134 

111468 R05809 Hs.205481 ESTs 134 

129560 H18428 Hs.113613 ESTs; Moderately simflarto HI! ALU SUB 134 



293T_celis, CALU6.cells, A549_cells 
EB.cefls, OVCARjaBs, Caco2 
HMEC, HMEC (total RNA), EB.cefls 
Lu.SC.H345, Lu.SC.tf69, PRSC.con 
HT29_cell3, BT474_cells, MB231.cells 
Lu_SCJi345, DU145_cells, LNCaP.cells 
OVCAR.cefl3, DU145_cells, PC3_cells 
EB.ceils, Fibroblasts 1 HMEC (total RNA) 
OVCAR.ceUs, EB.ceils, PC3_ceDs 
EB.ceils, DU145.ceDs, CALU6.oeIls 
Caool HT29.celIs, EB.ceils 
Lq_SC_H69, EB.ceils, Caco2 
Caco2, MB^DA-435s, LNCaP.oells 
DU145.ceQs, LNCaP.ceDs, OVCAfLcells 
Caoo2. DU145_cefis, PRSCJog 
2.4 U.SCLH520, ULAD.H23, EB.ceils 
LNCaP.cells, DU145_ceDs, OVCAR_celis 
OVCAR_cells, 293T cells, A549.ceQs 
EB.ceils, MB-MDA453, 293T_ce(ls 
EB.ceDs, LitAD_H23, LuJU)_358 
EB.ceils, Lu.SC.H345, Lu.SC.H69 
MB-MDA-435s, U_SC_H345, LuJD.H23 
Caco2, Lu_AD.358, LNCaP.cells 
EB.ceils, HS57BTcelIs, DU145_celb 
LilAD.H23, Lu.SC.H345, Lu_LC_H460 
EB.ceils, DU145.ce!ls, PC3.cells 
Fibroblasts 2, Caco2, HS578T.ceHs 

PRSC.con, MB-MDA-453, LNCaP.cells 
EB.ceils, Lu_AD_H23, Lu_SC_H69 
LNCaP.cells, Lu.SC.H69, UUC.H460 
Lu_ADJH23, LuJ.C.H460, Lu_LC.H460 
293T_ceBs, OVCAR.cefls, HS578T_ceiis 
LlAD_H23, Lu.SC.H69, Lu.SC.H345 
EB.ceils, LNCaP.cells, DU145.cells 
EB.ceils, 293T.oells, Lu.SC.H69 
OVCAFLcells, Caco2, CALU6_ce0s 
EB.ceils, LilAD.H23, Lu_SC_H345 
DU145_ceIls, OVGARjsBs, 293T.ce!ls 
Cecal LNCaP.cells, DU145.ceIts 
LuJ\DJH23, EB.ceils, PRSC.con 
EB.ceils, LNCaP.cells, DU145_oelis 
HS578T_ceils, MCF7, Lu_SC_H69 
OVCAR_ce&, DU145_cefls, Lu.SC_H345 
EB.ceils, DU145_ceHs, LNCaP.cells 
EB.ceDs, DU145.ceIIs, OVCAR_ceDs 
EB.ceils, LNCaP.cells, PRSC.con 
Lu_AD_H23, LuJVD.358, U.SCLH520 
Caco2, MB-&EA453, MCF7 
MB-MDA-453, Cacol OVCAR_cells 
EB.ceDs, 293T_ce3ls, LNCaP.cells 
LNCaP.cells, A549.cefe, Caco2 
Lu.SC_H345, LNCaP.cells, EB.ceils 
Caco2, EB.ceDs, DU145_ce!ls 
Fibroblasts! UlAD_H23, Lu_LC_H460 
EB.ceils, OVCARjcelb, HS578T_ce8s 
OVCAR.ceDs, Lu.SC.H345, MB-MDA-453 
Lu.SC.H345, Lu.LC.H460, LuJ\D_H23 
DU145_cells, LuJVD_H23, UlAD.358 
PC3_cefls, CALUB.ceDs, MB-MDA-435s 
A549_ceils, DU145_ceBs, HT29.cefls 
LNCaP.cells, PC3_ceIls, DU145.celb 
MB-MDA-435S, A549_cells. UUCJM60 
PC3.cells, HS57BT.cells, EB.ceDs 
DU145.cells, MB231_ceBs, HMEC 
LUJVD.H23, PC3_cells, PRSCJog 
EB__ce!is, LNCaP.cells, LNCaP.cells 
EB.ceils, HT29.celte, HMEC 
LuJ.C_H460, Caoo2, DU145_ceBs 
PC3.ceDs. EB.ceils, OVCAR_ceIls 
LilAD_H23, Lu_SC_H69, LilAD.358 
Lu.SC.H345, PC3_ceils, MB-MDA-435S 
BT474_cefls, Lu.LC_H460, UiJDJW 
Lu SC H69, HT29_ceIis, MB-MDA435S 
LulAD_H23 f PRSCJog, Lu.SQ_H520 
Lu.SC.H69, Lu.SC.H345, LNCaP.cells 
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104857 AA043219 Ha.19058 ESTs 134 

109647 F04587 Hs.28241 ESTs 134 

117160 H97817 Hs.183302 ESTs 2.34 

112352 R58974 Hs.167343 ESTs 234 

113653 T95745 Hs.187433 ESTs 234 

131606 W56804 Hs29385 AFG3 (ATPase family gene 3; yeast)-tlke 2J34 

101525 M29536 Ks.12163 eukaryotic translation initiation factor 134 

125921 AA775029 Hs.122591 ESTs 233 

125775 AA213555 Hs.29205 alpha Integrin binding protein 63 2.33 

103743 AA126917 Hs.71074 ESTs 2.33 

133735 AC002045 Hs251928 nuclear pore complex Interacting protein 133 

120403 AA234916 H&243851 ESTs 133 

134998 R02207 Hs.92679 ESTs; Weakly similar to microtubuls-base 133 

108456 AA079326 Hs.143654 ESTs 133 

130552 M86667 Hs. 179662 nudeosome assembly protein 1-IB(e 1 133 

111114 N63391 Hs.9238 ESTs 133 

127767 AI269498 Hs.125543 ESTs; Moderately simfiar to TADA1 protei 133 

106546 AA454725 Hs.21056 H sapiens mRNA from chromosome 5q21-22; 233 

122379 AA446110 Hs.250989 EST 233 

133650 D84294 Hs.118174 tetraticopeptide repeat domain 3 133 

106434 AA449099 Hs.8151 ESTs; Weakly similar to atopy related au 133 

105297 AA233451 Hs.183858 trajiscrrptfonal intermediary fector 1 233 

115976 AA447442 Hs.86327 ESTs 23 

105788 AA351031 H&23965 solute carrier famfly 22 (organic anion 133 



113774 W04550 Hs.9927 H sapiens mRNA; cDNA DKFZp564D156 (from 
110617 H68772 H&35820 ESTs; Weakly similar to D34I8.1 [H^apie 132 
102234 U26312 Hs.8123 chromobox homoiog 3 prosophiia HP1 gamml32 



114777 AA151699 Hs.184519 ESTs; Weakly similar to HI! ALU SUBFAMI 132 

125518 R20148 Hs.193851 ESTs 232 

130814 AA256695 Hs.19813 ESTs 2.32 

123473 AA599143 ESTs; Moderately similar to 111! ALU SUB 2.32 

134310 AA313414 Hs.8148 H sapiens done 24856 mRNA seq; complete 2.32 

119192 R85375 H&237262 EST 132 

114391 AA004876 Hs/133100 ESTs 132 

119133 R49144 Hs.119756 ESTs 132 

109710 F09792 Hs.12929 ESTs 132 

116726 F13681 Hs.42309 ESTs 132 

133206 R32993 Hs.6762 ESTs; WeaWy simitar to simflar to teucy 131 

135163 AA125988 Hs.199955 ESTs 131 

111219 N68836 Hs.19247 ESTs 131 

110283 H29565 Hs.12271 ESTs 231 

103772 AA092473 Hs.8123 chromobox homoiog 3 (Drosophfla HP1 gamm231 

122766 AA459386 Hs. 194058 ESTs; WeaWy similar to atypical PKC spe 231 

120886 AA365566 Hs.132736 ESTs; WeaWy similar to allograft inflam 231 

123512 AA600248 Hs.142245 HERV-H LTR-assodating 3 131 

106644 AA460239 Hs.12680 ESTs 131 

127359 H72971 KIAA0277 gene product 131 

105919 AA402494 Hs3990 ESTs 131 

125241 W86291 Hs.121593 ESTs 13 

104624 AA001936 Hs.184721 ESTs 23 

128765 AA101757 Hs.10494 ESTs 23 
108360 AA071539 zm74b6.s1 Stratagene neuroepfthelliim (#9 

HYDROXYSTEROID DEHYDR0GENASE/DELTA-5-DEL 

115682 AA410300 Hs.44618 ESTs 23 

134528 M23161 Hs.84775 Human transposon-like element mRNA 23 

111091 N59858 Hs33032 H sapiens mRNA; cDNA DKFZp434N185 (from 

134044 AA262475 Hs.78746 phosphodiesterase 6A 129 

118229 N62339 Hs.180532 heat shock 90kD protein 1; alpha 129 

110188 H20522 H&20969 ESTs 129 

125073 T87185 Hs.193636 ESTs; Weakly similar to III! ALU CLASS C 129 

111495 R07210 Hs.19913 ESTs 129 

124024 F03077 Hs.106672 ESTs 129 

128230 AA984074 Hs.176757 ESTs 129 

125471 AA477571 Hs.152601 UDP-gtucose ceramide glucosyttransferase 129 

120734 AA299949 EST12545 Uterus tumor I H sapiens cDNA 3 228 

134349 AA406373 Hs.8208 ESTs 2.28 

123412 AA521443 Hs.187763 ESTs 128 

116297 AA489042 Hs.59498 ESTs 128 

104476 N33807 Hs.223014 protease; serine; 15 128 

101004 J04101 H&248109 v-ets avian erythroblastosis virus E26 o 228 

109991 H09813 Hs.12896 KIAA1 034 protein 228 

118934 N92571 Hs£4808 ESTs 2.28 

125096 T94328 Hs.194533 ESTs 128 

117514 N32226 H&124058 ESTs 128 

132792 AA4M903 Hs.242985 hemoglobin; gamma G 128 

129009 AA131421 Hs.107884 ESTs 128 



UlADJ-123, U.SCJH345, Lu.SC.H345 
HS578T.cells, A549.cefls, CALU6_ceIls 
EB.ceBs, Fibroblasts 2, Lu.SC.K69 
EB.celis, Lu_SC_H345 # KT29.ceDs 
MB-MDA435S, MB-MDA453, Lu_SCJH345 
OVCAFLcelis, Fibroblasts 2. MB4UDA-435S 
EB cells, Caccl, DU145.ce!ls 
293T cells, PRSCJog, Lu.SC.H345 
EB cells, DU145.cells, LNCaP.cells 
LuJ\D.H23, LuJ\D_358, LUJ.CJW60 
LNCaP.cells, LiL.SC.H69, DU145_ceI!s 
MB231.ceUs, Lu_SC_H345, Lu.SC.H69 
LNCaP.cells, BT474.cefls, MCF7 
HT29.cells l LuJ\D_H23, RPWEJ 
EB.celis, A549.ceBs, DU145_ceHs 
Cacol EB.celis, MB-MDA-453 
CALUe.cells, 293T_celis, PC3_ce!is 
OVCAR_cete. Caco2, LNCaP.cells 
BT474 cells, Fibroblasts 2, MB-MDA435s 
Lu SC.H345, EB.celis, EB.celis 
EB cells, LNCaP.ceDs, Caco2 
EB.celis, LNCaP.cells, Caco2 
EB.celis, 293T_ceBs, Lu.SC.H69 
EB.celis, LiLADJt23, Lu.SC.H345 
232 OVCAR_cells, EB.ceDs, Lu.SC_H69 
Lu_SC_H345, LuJ\D_H23, PRSC.con 
CALU6_cells, LNCaP.ceDs, A549_ceUs 
HT29.celis, Fibroblasts 2, Lu.SC.H345 
HT29.CBBS, HMEC (total RNA), MB231 .ceils 
MB-MDA-435S, Lu.SC.H69, PRSCJog 
UJCaP.ceQs, DU145.C8lb, Lu.SC_H345 
PC3.celIs, LNCaP.cells, OVCAILcells 
Lu.SC.H69, PRSCJog, PRSC.con 
PC3_cells, 293T.ce0s. 293T.cells 
PRSCJog, 293T_cefls, 293T.cefls 
ULAD.H23, Lu.SC.H69, Lu.SC.H345 
MCF7, BT474.ceDs, MB-MDA-453 
DU145_ceIls, 293T.ce!Is, EB.celis 
Lu.SC.H345, LNCaP.cells, DU145.ceBs 
OVCAR_ce0s, LNCaP.ceBs. 293T.ceDs 
BT474.celis, MB231.cells, MB-MDA-453 
CALUS.ceQs, MCF7, DU145_cells 
HT29.ceHs, BT474_cells, HMEC 
DU145 cells, A549.ceils, Lu.LC.H460 
PC3.cete, 293T_ceils, DU145_ceDs 
HS578T.cells, MB231_ceils, Lu.SQ_H520 
Lu.SC.H345, DU145.ceHs, OVCARjslIs 
HS578T_cefls. DU145.celb, LNCaP.cells 
HMEC, HMEC (total RNA). EB.celis 
DU145.cefls, PC3.cells, PRSCJog 
EB.celis, HMEC (total RNA), Lu.LC.H460 

23 HT29_cete, RPWEJ, LuJVD_H23 
HT29.ce«s, U.SCLH520, Lu^AD_H23 
EB.celis, CALU6_cells, A549 cells 
13 LNCaP.cells, DU145_cells, PRSCJog 
DU145_oeDs, A549_oefls. MCF7 
MCF7, DU145.ceBs, EB.celis 
Fibroblasts 2, MB-MDA435S, Lu_LC_H460 
EB.ceDs, Lu_SC_H345 ( Lu.SC_H69 
CALU6.cells, EB.celis, MCF7 
HS578T„ceBs, RPWEJ2, Lu JVDJ5B 
LNCaP.cefls, DU145_cefc, OVCAR_ceils 
DU145.ceBs, PRSC.con. PRSCJog 
LilAD.H23, Lu_SC_H345, Lu_SCJI69 
DU145_calls, PC3_ceBs, LNCaP.cells 
BT474.ceBs, BT474_ceiis, Lu.SC.H69 
EB.celis, 293T.ce)ls, MB-MDA453 
LNCaP.cells, MCF7, PC3.ceDs 
HT29_celIs, MB-MDA435s. HMEC (total RNA) 
EB.ceOs. CALU6_cells, 293T.cells 
HS578T.cells, 293T_cefls, A549_cells 
Lu.SC.H345, Lu.SC.H69, 293T.celb 
CALU6_cells, HMEC, Lu_AD_H23 
OVCAR.cel)s, LilSC.H69, MCF7 
HS578T.ce!ls, CALU6.celIs. Caco2 
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111858 R16981 Hs.15276 ESTs 228 

112322 R55757 Hs26457 EST 22B 

133477 W69310 Hs.740 PTK2 protein tyrosine kinase 2 228 

132149 T10822 Hs.4095 ESTs 228 

115119 AA256524 Hs.46847 Human DNA seq from done 3QM3 on chromos 

yeast and arcnaea bacterial genes; and 2.27 

102130 U15009 Hs.1575 small nuclearribonucteoproteln D3 polyp 2.27 

114343 Z41424 Hs21259 ESTs 227 

106746 AA476436 Hs.7991 ESTs 227 

119359 T71021 Hs.93334 ESTs; Highly similar to WS baslo-tieDx-l 2.27 

106301 AA435867 Hs.168212 kinesw family member 3B 227 

130260 L13738 Hs.153937 activated p21cdo42Hs kinase 227 

119724 W69468 Hs.47622 ESTs 227 

108960 AA150199 Hs.49378 DKF2P586D0919 protein 227 

103489 Y08614 Hs.79090 exportin 1 (CRM1 ; yeast; homoiog) 226 

107711 AA015736 Hs.220687 ESTs 226 

131950 W64704 Hs.35380 ESTs 2.26 

107093 AA609600 Hs.1Q018 ESTs 226 

113649 T95641 Hs.16400 ESTs; Weakly similar to Hrs [H.sapien$] 226 

105255 AA227498 Hs.3623 ESTs 2.26 

130094 H43286 Hs.167017 gamma^mirtobirtyric acid (GABA) B recepto 226 

111674 R37959 Hs.13358 ESTs 226 

107890 M026030 Hs.61311 ESTs; Weakly similar to CALPAIN 2; LARGE 226 

124628 N74702 Hs. 102834 ESTs 226 

119707 W67569 Hs.44143 ESTs; Weakly similar to SNF2alpha protei 226 

106737 AA470080 H&36237 ESTs; Moderately similar to CGW4 prote 226 

117305 N22798 Hs.43248 EST 226 

134470 X54942 Hs.83758 CDC28 protein kinase 2 226 

130734 T99337 Hs. 18624 K1AA1 052 protein 226 

128561 R69227 Hs.101489 ESTs 226 

100670 HG2992-H Befa-Hexosaminidase, Alpha Polypeptide, 226 

115953 AA443958 Hs.90960 ESTs 226 

129612 H17476 Hs.11615 ESTs; Highly similar to map kinase phosp 225 

111362 N91973 Hs23595 deoxyrfoonuctease ill; dnaQ/mutD (E col 225 

116275 AA485453 Hs250911 interleukin 13 receptor, alpha 1 225 

114461 AA024848 Hs.126705 ESTs 225 

134083 AA278393 Hs79013 ESTs 2.25 

132470 Z24724 Hs.4934 Helens poryA site DNA 225 

114718 AA131328 zo8d1.s1 Stratagene neuroepitheBum NT2R 

SW:COX^MOUSE P45 CYTOCHROME C OXIDASE P 

129499 R40395 Hs242908 tedthin-cholesterol acyttransferasa 225 

124758 R38422 Hs.169168 ESTs 225 

130301 X83127 Hs.172471 potassium voltage-gated channel; shaker- 225 

131263 R38334 Hs24950 regulator of G-protain signalling 5 225 

107159 AA621340 Hs.10600 ESTs;WeaWysimiiartoORFYKR081cIS^ 225 

133262 N72009 Hs.206710 ESTs 224 

132985 AA093619 Hs.62113 K1AA0717 protem 224 

114172 239043 Hs21421 ESTs; Weakly simitar to cysteine desuffu 224 

127847 AA913387 Hs.126717 ESTs 224 

106499 AA452244 Hs.16727 ESTs 224 

105095 AA150088 Hs27023 KIAA0917 protem 224 

108876 AA134361 Hs.191453 ESTs 224 

121971 AA429667 Hs, 120405 ESTs 224 

114334 Z41342 Hsl22941 ESTs 224 

114565 AA063001 Hs.103527 SH2 domain protein 2A 224 

115766 AA421751 Hs.77603 ESTs 224 

130989 AA608546 Hs.21906 ESTs 224 

116304 AA489461 Hs.64742 H sapiens mRNA for K1AA0540 protein; par 224 

111154 N66545 Hs29169 ESTs 224 

105561 AA262881 Hs.16029 ESTs; Weakly similar to alternatively sp 223 

105939 AA404421 Hs.12258 ESTs 223 

126379 AI085342 Hs. 166146 Homer; neuronal immediate early gene; 3 223 

106610 AA45B882 Hs.4832 ESTs; Moderately similar to Lasp-1 prote 223 

132786 AA424545 Hs.56851 H sapiens mRNA expressed in placenta 223 

107206 D20728 Hs.30767 ESTs 223 

133708 R42172 Hs.75667 synaptophysin 223 

135123 AA227567 Hs.9482 targetofmybl (chicken) homoiog 2.23 

132156 AA157401 Hs.4113 S^denosyihomocystelne hydrobse-fike 1 223 

116934 H75624 Hs.39662 ESTs 223 

133660 R87373 ym88e05.r1 Soares adult brain N2MHB55Y 

IMAG&166016 5\ mRNA seq. 223 

119468 W23633 Hs.125043 ESTs 223 

101247 L33801 Hs.78802 glycogen synthase kinase 3 beta 223 



126008 AA253460 zs06f04.s1 NCLCGAP.GCB1 H sapiens cDNA 

122938 AA477119 zu37c7.s1 Soares ovary tumor NbHOT H sap 

TRS288289 G288289 MITOCHONDRIAL D-LOOP 



MB-MDA435s, 293T_ceUs, A549.celis 
Lu_SC_H345, Lu_SC_H69, Lu_AD_358 
EB.cells, PC3.ce0s, DU145.celis 
LNCaP.cells, EB.cells, PC3_ceils 

A549.cells, EB.cefls, LNCaP.ceils 
LNCaP cells, Caco2, EB cells 
HT29_celb, OVCAR.cells7 Fibroblasts 2 
UtAD.358, RPWE_2, UlADJH23 
Lu.SC.H69, 293T.C6US, DU145.cefls 
OVCAR_ce!ls, LNCaP.cefls, EB.cells 
MB-MDA453, DU145_ceiis, DU145_ceils 
PC3_cells, HT29_ceils, A549_cells 
EB.cells, HS578T_ceHs, LuJVD_358 
EB.cells, CALU6_cells, DU145_ceDs 
EB cells, luJD H23, LuJU) 358 
HS578T.cells, OVCAR.celis, MB-MDM355 
LNCaP cells, OVCAR.cel]s, DU145.cells 
ULADJH23, Lu.SC.H69, PRSC.tog 
HS578T_cells, 293T_cefe, Lu.SCJ*345 
F3xobIasts2, MB231.ceHs, 293T_cells 
CALUe.oelb, UcS(LH520, 293T.ceDs 
HT29 ceBs, MB^DA453, PC3_ceils 
293T_cells, CALU6.ceBs, CALU6.cells 
293T_ceBs 1 OVCAR.celis, Lu_SC_H345 
LNCaP.cells, DU145_ceUs, MB-MDA435s 
HT29_cefe, BT474_cells, Fibroblasts 2 
DU145.ceQs, CALU6.ceDs, LliCaP.celis 
LUJVD.H23, Lu.SC.H345, Lu.SC.H69 
Lu SCJH345, DU145.cells. OVCARjalis 
HT29.cefls, BT474_cefls 1 Lu.SC_H345 
Caco2, 293T.cells, DU145_cells 
CALUe.cells, LNCaP.ceDs, PC3_ceite 
U_SQ_H520, Lu JU3.H23, RPWI=_2 
OVCAR.ceIls, 293T.cells, DU145.ceBs 
EB.cells, Lu_AD_H23, Luj4D_H23 
293T_ceBs t EB.cells, OVCARjslls 
EB.oefis, HS578T.cefls, Caco2 

225 MB-MDA-435S, HT29.ceIfe, Lu.SC_fl69 
HMEC (total RNA), Fibroblasts 2, HMEC 
293T_ceHs, RPWS.2, Ui.LC.H460 
EB.cells, OVCAR_ceIls, A549_ce0s 
hiJDjm, EB.cells, Ui.SC.H69 
LNCaP.cells, HMEC, EB.cells 
Lu.SC.H345, DU145.cefls, LNCaP.ceils 
EB.cells, UiJDJ\23, LilAD.358 
293T_cens, CALU6_oells, Lu_SCLH520 
LNCaP_ceBs, DU145.cells, Lu.SC.H69 
Lu.SC.H345, N&MDA-453. Lu_SC.H69 
DU145.cells, LNCaP.cells, CALU6.cstis 
EB.cells, Lu SC.H345, LuJ\D_H23 
ULAD.H23, 293T_ce&, CALU6.ceDs 
DU145_cells, PC3.ceils, EB.cells 
Lu.LC.H460, MCF7, HMEC (total RNA) 
Fibroblasts 2, MB-MDA-435s, MB231_ceSs 
PC3_ceHs, LNCaP.ceDs, DU145.ceIb 
BT474.cel)s, EB.cells, LNCaP.cells 
OVCAR.cells, MB-MDA435s, HMEC 
HS578T_ce!ls, A549_cells, HMEC 
EB.cells, LNCaP.cells, DU145_cefls 
HS578T.cells, PC3_ceHs, RPWE^ 
DU145_ceils, MCF7, Lu.SC.H345 
EB.cells, UlAD_H23, Fibroblasts 2 
BT474.celIs, Fibroblasts 2, MB-MDA435s 
Lu.SC.H345, CALUS.ceto, Lu.SC.H69 
BT474_ce!ls, MB231_cells, EB.cells 
DU145_cells, 293T.celIs, LNCaP.cells 
CALU6_cells, Lu.SC.H345, Lu_LC_H460 

DU145.cells, A549.ceHs, PC3_cefls 
293T_cefls, MB-MDA-453, OVCARjsBs 
LNCaP.cells, EB.cells, MB-MDA435s 
223 HT29.celIs, PRSC.tog, F&robfasts2 

Z23 PC3.oe!ls, MCF7, MB4tfDA-435s 
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114148 Z38804 Hs.164777 ESTs: Moderately similar to OPIOID BINDI 

MOLECULE PRECURSOR [Haptens] 2.23 
103433 X98001 Hs.78948 Rab geranyigeranyttransferase; beta siibu 2.22 
132954 AA027112 Hs.216194 ESTs 122 
133228 N90Q29 Hs.6831 H sapiens done 1400 unknown protein mRN 2.22 
103891 AA242887 Hs.124186 ring finger protein 2 2.22 
124883 R75630 Hs.177242 ESTs 222 
109921 H05734 Hs.30559 ESTs 122 

127306 A130S162 Hs.193687 ESTs 222 

102707 U77456 Hs78103 nudeosome assembly protein 14ike 4 122 

106193 AA427625 Hs.23272 ESTs 122 

118819 N79045 Hs.50800 ESTs; Weakly similar to III! ALU SUBFAMI 122 

134326 U16306 Hs.81800 chondroftin sulfate proteoglycan 2 (vers 222 

112241 R51248 Hs.16027 ESTs 222 

123693 AA609591 Hs.1 12728 ESTs 122 

1290S2 AA496297 Hs.182740 ribosomai protein S11 122 

122481 AA448271 Hs.99126 ESTs 121 

128895 R37753 Hs.106985 ESTs 221 
124691 R05835 Hs.110153 ESTs; Weakly similar to B-CELL GROWTH FA 

131556 AA442853 Hs.2869 cydirHlependent kinase 5; regulatory su 121 

128869 AA424570 Hs.106736 ESTs 121 

107114 AA610089 Hs.11776 U4AJ6-assodated RNA spfidng factor 121 

106255 AA431191 Hs.161489 ESTs 221 

130724 AA370091 Hs.179680 ESTs 22 

105483 AA255874 Hs.23458 ESTs 12 

118970 N93503 Hs.54961 stoned B/TFllA-a!phaJbeta4ike factor 12 

120805 AA346041 Hs.96844 ESTs 12 

108158 AA425382 Hs.6553 ESTs 12 

102121 U14391 Hs.82251 myosinic 12 

109446 AA232125 Hs.87062 ESTs 22 

129515 AA490882 Hs.1 12227 ESTs 12 

113128 T49325 Hs.8977 ESTs 12 

127289 AI041014 Hs.220752 ESTs 12 

129912 AA047344 Hs.107213 ESTs; Highly simflar to NY-REN-6 antigen 12 

115700 AA411685 Hs.67709 ESTs 12 

106267 AA431873 Hs.4988 H sapiens clone 2471 1 mRNAseq 12 

112881 T03593 Hs.182814 ESTs 119 
116902 H70739 yu69f11.s1 Weizmann Olfactory EpitheBum 

IMAGE:239085 3' similar to contains LTR 119 
105621 AA280865 Hs.6375 H sapiens mRNA; cDNA DKFZp564K0222 (from * 

128991 R31652 Hs.821 bigtycan 119 

125466 R08234 Hs.180461 ESTs 119 
108491 AA082973 zn7gU1 StralagenehNT neuron (#937233) 

to gb:M3672 6S RIBOSOMAL PROTEIN L7A (H 

109978 H09356 Hs.22528 ESTs 119 

106990 AA521354 Hs.24758 ESTs 119 

122362 AA443919 Hs.96840 ESTs 119 

125367 AI016490 Hs.81964 SEC24 (S. cerevisiae) rented gene famfl 119 

110716 H97186 Hs.35098 ESTs 119 

129297 R11267 Hs.180570 H sapiens chromosome 19; cosmW F22329 119 

104992 AA102652 Hs.22753 ESTs; WeaWy simitar to coded for by C. 118 

119896 W84738 Hs.137319 ESTs 118 

118594 N69022 Hs.49599 ESTs 118 

129786 H98977 Hs.246109 ESTs 118 

104325 D81608 Hs. 1 50675 polymerase (RNA) II (DNA directed) polyp 118 



123022 AA48D909 aa2Bf10.s1 NCLCGAP.GCB1 H sapiens cDNA 



Alu repetitive etemenfccorrtains element 118 

133572 W94333 Hs.7499 transtocase of inner mitochondrial membr 118 

133363 AA479713 Hs71962 ESTs 118 

135361 AA053319 Hs.167700 ESTs 2.18 

128319 AA808904 Hs.115095 ESTs; WeaWy slmBar to RHO-RELATED GTP- 
DU145.C8DS 

128660 AA011597 Hs.177398 ESTs 118 

114877 AA235618 Hs.205125 ESTs 118 

125925 H28737 ESTs; Moderately similar to fill ALU SUB 118 

113427 T85105 Hs.15471 ESTs 2.18 

117500 N31909 Hs.44278 ESTs 118 

131384 F136Q8 Hs.26226 ESTs 118 

134499 U70370 Hs.84136 paired* homeodomain transcription fa 118 

128154 AA922969 Hs.127100 ESTs 117 

134585 T48154 Hs.168655 H sapiens mRNA for H-2K binding factor-2 117 

104987 AA101723 Hs.16683 ESTs 2.17 

132992 AA091017 Hs.6226 ESTs 117 

135311 M36089 Hs.98493 X-ray repair complementing defective rep 117 

113171 T54613 Hs.9761 EST 117 

117736 N46999 Hs.46648 ESTs 116 



HS57BT_ceDs, Fibroblasts 2, Lu.SC.H345 
LNCaP.cells, EB.ceils, 293T_cells 
EB.ceBs, UlAD_H23, Fibroblasts 2 
293T_ceHs, PC3_ceIls, DU145_ceils 
EB.cells, Lu.SC_H69, Lu.SC.H345 
EB.cells, ULAD.H23, Lu.SC.H345 
Lu_SQ_H520, 293T_cefls, RPWEJ 
MCF7, HT29_cefls, MB-MDA-453 
Caco2, EB.cells, CALU6_celIs 
293T.cells, EB.cells, A549_cells 
Lu_SCH345, Lu.SC.H69, DU145_ceBs 
HS578T ceDs, PRSCJog, CALU6.ceHs 
293T_cefls, HMEC (total RNA), HMEC (total RNA) 
HT29_ceIls, HMEC (total RNA), BT474.celis 
EB.cells, LuJU)_H23, UlAD.358 
UlAD.H23, HTig.cells, Lu_AD.358 
EB.ceils, UiJDjm, LilSC_H345 
2.21 EB.ceils, ULADJ123, UlAD.358 
HT29.C6IJS, LuJ£_H460, UlSC_H69 
EB.ceils, Lu JDH23, Lu.SC.H69 
MCF7, Lu.SC.H345, DU145.cells 
EB.ceils, Cacol DU145.ceIIs 
EB.ceDs, hiJDjm, Lu_SC_H69 
LNCaP.cells, DU145.celis, PC3.ceIls 
293T.cete, HS578T_ce8s, OVCAJLcefls 
HT29.cete, HS578T.ceJis, 293T.ceils 
CALUe.ceBs, PC3_cete, EB.ceils 
A549.ceQs, EB.ceils, Caco2 
HT29_celIs, U1.LCJW6O, CALUS.ceHs 
Lu.SC.H345, BT474_ceUs, Caco2 
Lu.SQ.H520, UUtoJW, Lu.AD.358 
EB.ceils, Lu.AD.H23, UlAD_H23 
CALUe.cells, A549.ceDs, EB.ceDs 
0VCAR.C6US, EB.ceils, Caco2 
Lu.SQ.H520, EB.ceils, PC3_ceUs 
A549.cells, OVCAR.cefls, 293T.ceJls 

LNCaP.cells, DU145.ceIls, PC3_ceBs 
119 Hft/EC, Caool HMEC (total RNA) 
Fibroblasts 2, Lu_SC.H69, HS578T_ceRs 
LitAD.358, ULADJ423, Lu.SQ.H520 

119 LulAD.358, RPWEJ, Lu.LC.H460 
PRSCJog, Lu.SC_H345, Lu.SC_H69 
EB.ceils, LNCaP.cells, OVCARjssQs 
EB.ceDs, LuJ\D.358, PRSC.con 
HT29_ceiJs, Lu.SC_H69, UiJDjm 
DU145.cells, Fibroblasts 2, PRSC.con 
293T.ceDs, M&M)A435s, A549.cells 
MCF7, MB-MDA-453, Lu SCLH520 
293T_celis, 293T.ce0s, OVCAFLcells 
Lu.SC.H69, LLAD.H23, Lu.SC.H345 
293T_ceIls, 293T_cells, 293T ceOs 
PC3_oeHs, Lu.SC.H345, LNCaP.cells 

OVCAR_ceIls, DU145.cells, LNCaP.cells 
Caco2, LNCaP.cells, Lu_SCLH520 
EB.cells, Lu_AD_H23, Fforablasts2 
EB.cells, 293T.celIs, Caco2 
2.18 Lu_SC_H345, OVCAR.cefls, 

EB.cells, Lu.AD.H23, Lu.SQ.H520 
DU145_ceils, 293T.ce0s, OVCAR.ceIls 
Lu.SC.H69, Lu.SC_H345, HS578T cells 
EB.ceDs, UUD.H23, Lu.SC.HS9 
PRSC.con, Lu.SC.H345, PRSC tog 
293T.ceBs, LNCaP.celis, OVCAR.celIs 
Caco2, BT474.ceils, MB231.celIs 
MB-MDA-453, MB-MDA453, Lu.SC.H345 
LNCaP.cells, 293T_ceIls, PRSC.log 
EB.cells, MCF7, DU145.ceiis 
Caccl, LNCaP.cells, DU145 cells 
HMEC (total RNA), Fibroblasts 2, HMEC 
HT29_cells, PRSC.con, LU.SCLH520 
PRSC.log, OVCAR.ceIls, A549_ceDs 
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125181 W58461 Hs.12396 ESTs 2.16 

120187 240251 H&56974 ESTs 2.16 

100308 D50532 Hs.54403 macrophage lectin 2 (calcium dependent) 2.16 

110960 N50887 Hs.26549 ESTs; Weakly similar b KIAA0449 protein 2.16 

113608 733113 ESTs; Moderately simfiar to ALU SUB 2.16 

107538 Z210B9 Hs.50094 ESTs; WeaWy similar to KALIRIN [Rjiorve 2.16 

128703 S76992 Hs.104005 vav 2 oncogene 116 

126065 A1366484 ESTs 2.16 

130000 AA465727 Hs.124084 EST s; Weakly similar to Mil ALU SUBFAMl 2.16 

120407 AA235040 Hs.107283 ESTs 116 

121199 AA400371 Hs.97792 ESTs 2.16 

114963 AA243867 Hs.193055 ESTs 2.16 

100343 D63874 Hs.189509 hfgh-moblfity group (nonhistone chromoso 2.15 

125077 T88822 yd32&s1 Soares fetal liver spleen 1NFL 2.15 

117286 N22181 yw36d12.s1 Morton Fetal Cochlea H sapien 2.15 

132876 AA130603 Hs.169683 ESTs; Moderately similar to III! ALU SUB 2.15 

133834 AA147510 Hs.154737 serine protease; umbilical endothelium 2.15 

126908 AA169866 ESTs; Weakly similar to till ALU SUBFAMl 115 

106900 AA490142 Hs.6193 ESTs 2.15 

129398 AA437374 Hs.234573 H sapiens mRNA for TL132 115 

114512 AA044274 Hs.165215 ESTs 2.15 

134381 U56637 Hs.184270 capping protein (adn filament) muscle 2.15 

118848 N80671 Hs.220255 ESTs 2.14 

115526 AA342049 Hs.69606 ESTs 2.14 

123460 AA598981 Hs.251122 EST 114 

119812 W73951 Hs.58348 ESTs; WeaJdy similar to CORNIFIN A [H.sa 114 

105263 AA227926 Hs.6682 ESTs 114 

129242 W61679 Hs.5174 ribosomal protein S17 2.14 

132348 AA037285 Hs.170311 heterogeneous nuclear ribcnucteoprotein 2.14 

114425 AA015763 Hs.132812 ESTs 114 

1277% A1369384 arytsuffatase D 114 

134069 U29607 Hs.78935 methionine aminopeptidase; dF-2-associa 114 

116158 AA461187 Hs.61762 ESTs 114 

125627 R35166 Hs.14881 ESTs 114 

118684 N71364 Hs.109510 ESTs 114 

119419 797977 Hs.60260 ESTs 2.14 

133097 N57515 Hs.6479 ESTs; Weakly simitar to KIAA0872 protein 114 
112121 R45445 Hs.252723 H sapiens mRNA; cDNA DKFZp434D115 (from 

114894 AA236019 Hs.188803 ESTs 113 

124087 H08773 yI94d5.s1 Soares infant brain 1NIB H sap 113 

111902 R39191 Hs.109445 WAA1020 protein 113 

119943 W86835 Hs.14158 copinelll 113 

109276 AA196306 Hs.86045 ESTs 113 

117351 N24581 Hs.43230 ESTs 113 

116046 AA453461 Hs.94491 H sapiens done 23585 mRNA seq 113 

112785 R96478 Hs.16586 ESTs 113 

115835 AA428576 Hs.41371 ESTs 113 

127499 T49891 Hs.1 19252 tumor protein; transtationally-controlle 113 

129951 AA019475 Hs.74615 ptatelet-denVed growth factor receptor; 113 

124270 H79560 Hs.107840 ESTs 113 

133766 D52420 Hs.184326 ceO division cycle 10 (homologous to CD 112 

109248 AA194720 Hs.189996 ESTs; Highly similarto sec61 homotog [H 112 

106724 AA465226 Hs.28631 ESTs 112 

100571 HG2264-H Atpase, Ca2+ Transporting, Plasma Membra 112 

133017 AA450187 Hs.178518 ESTs 112 

124313 H94650 Hs.108002 ESTs 112 

113059 T26925 Hs.172684 veside-associated membrane protein 8 (e 112 

113241 T63313 Hs226136 ESTs; Moderately similarto ilil ALU SUB 112 

111952 R40782 Hs^1296 ESTs 112 

113965 W86519 Hs.19631 ESTs 112 

108059 AA043944 Hs.62663 ESTs 112 

124235 H63994 Hs.221134 ESTs 112 

106400 AA447621 Hs.31257 ESTs 112 

119590 W44798 Hs.55876 ESTs 112 

112434 R63068 Hs.159793 EST 111 
122731 AA457549 aa92b1^1 Stratagene fetal retina 93722 

gb:X5275jna3 LEUKOSIAUN PRECURSOR (HU 

115348 AA281562 Hs.88860 ESTs 111 

128873 AA226768 Hs.109463 ESTs; Weakly similarto predicted using 111 

133742 T54301 Hs.75844 ESTs 2.11 

102099 U11670 Hs.194778 interieukin 8 receptor; alpha 2.11 

125840 H05787 Hs.12064 ubiquitin specific protease 22 111 

105501 AA256604 Ks.31930 ESTs 11 

111576 R10334 Hs.15489 ESTs 11 

104275 CQ2170 Hs.39387 ESTs; Weakly similar to weak similarity 11 

117803 N48620 r&28483 pregnancy specific beta-1^!ycorjrotein 9 11 



LNCaP.celis, DU145j*2s, 293T_cells 
LNCaP_oeIls. MB4/IDA-453, HMEC (total RNA) 
HT29_ceBs, LilAD_H23, LuJVD_H23 
Caco2, A549.cells, LNCaP.celis 
Lu_SC_H69, CALU6„ceDs, 293T_cells 
HS578T.ceils, 293T_cells, DU145_cells 
RPW^J, Li^SC_H69, HT29_cells 
293T_cete, CALU6.cells, A549„ce)ls 
DU145_cefis, LNCaP.celis, OVCAR_ce0s 
EB.cells, 293T.ce«s, A549_ceils 
LuJ\D.358, [uJDJW, A549_cells 
DU145_celis, PRSC_con, LNCaP.celis 
CALU6_cells, MB-MDA-453, Caco2 
LuJVD_H23, Lu.SC.H69, Lu.SC.H345 
293T.celts, Lu.SC.H345, Lu.SC.H69 
EB.cells, LNCaP.ceBs, HS578T.cells 
DU145 cells, EB cells, Caco2 
OU145lceQs, LNCaP.cefis, OVCAR_C8lls 
Fibroblasts 2, Lu_AD_H23, PRSC.con 
MCF7, DU145.celIs, LNCaP.celis 
LlAD.358, MB4©A-453, HS578T_celb 
LNCaP.celis, EB.cells, PC3_cell3 
EB.ceBs, DU145_cefe, MCF7 
293T.ce&, Caco2, Lu_SC H69 
LlSC_H345, DU145_ceBs, MCF7 
293T_cefe, HS578T_ceils, CALUB.ceOs 
A549_cens, HMEC (total RNA), EB.ce&s 
293T_ceils, CALUB.cefls, HMEC (total RNA) 
A549_ceHs, HT29_celIs, Lu.SQ.H520 
293T.ceBs, HS578T_cefe, PRSC.con 
DU145_ce0s, LNCaP.celis, EB.cells 
Lu.SC.H345, DU145_ceRs, MCF7 
Lu.SC.H69, MCF7, MB-MDA-453 
HT29_ceQs, Ffrrohiasts 2, BT474_ceDs 
OVCAR_cells, PRSC.con, HS578T.ceils 
LALADJ123, Lu.SQ.H520, bi.SQ.H520 
EB.ceBs, LILAD.H23, LuJVD.358 
113 LuJVD_H23, LuJUJ.358, BT474_cefls 
MB-MDA-453, MCF7, U.SCLH520 
Lu.SC.H69, Fibroblasts 2, HMEC (total RNA) 
Cacol 293T.ceiis, Lu_SC_H69 
LNCaP.celis, PC3.ceRs, HS578T.ceHs 
Lu.SC.H345, Lu.SC.H69, LU.LCJW60 
HS578T_ceCs, CALU6_cells, PRSC.con 
LNCaP.celis, Caco2, EB.oalls 
EB_cells, Lu_AD_H23, Lu.SC.H69 
EB.cells, Lu.SC.H345, OVGARjjbDs 
EB.ceBs, PRSC.con, LNCaP.celis 
EB.ceBs, LLACLH23, Lu.SC_H69 
OVCAR_ceils, 293T.celb, 293T_ceHs 
CALU6_ceBs, DU145_ceIls, PC3.ceHs 
HT29.ceUs, MB231.ceils, HMEC (total RNA) 
EB.ceBs, 293T.ceils, DU145.ceBs 
EB.cefls, Lu_AD_H23, Lu_SC.H69 
OVCAR_ceDs, PC3_ceils, 293T.cells 
MB-MDA-453, U.SC.H345. HT29_ceils 
MB-MDA-453, PC3.ceib, LNCaP.celis 
HMEC (total RNA), BT474.celfe, HMEC 
HT29.cells, PC3.ceils, A549_cefis 
PC3.celb, EB.cells, LNCaP.celis 
EB.ceBs, OVCAR.ceBs, 293T.ceUs 
Fibroblasts 2, MB-MDA-453, PRSC con 
DU145.csUs, EB.cells, Caco2 
PRSCtog, Lu.SC.H69. Lu.SC.H345 
HS578T.ceBs, LNCaP.celis, OVCAR.ce(ls 

111 MBM)A-453, RPWEJ, MCF7 
EB.cells, LilAD_H23, Fibroblasts 2 ' 
MB-MDA-435S, EB.cells, LNCaP.celis 
EB ceils, CALU6.ceDs, DU145.ceDs 
LlAD.358, PC3.ceIls, PRSC.con 
EB.cells, LNCaP.celis, Caco2 
Fibroblasts 2, HS578T.cefls, MB-MDA435s 
Lu.SC.H69, PRSCJog, Lu.SC.H345 
HT29_cefls, MB231.ceIls, U.SC.H69 
HT29.celis, HMEC, RPWE _2 
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122725 AA457407 Hs.1 52204 transmembrane protease; serine 2 2.1 

120987 AA398233 Hs.111894 KIM0108 gene product 2.1 

105932 AA403305 Hs.12185 ESTs; Weakly similar to myosin phosphate 2.1 

118398 N64706 Hs.137282 ESTs 2.1 
103679 Z86000 Human DNA seq from PAC 151B14 on chromos 

receptor subtype 3 (SSTR3), tRNA, ESTs, 2.1 

130303 L40392 Hs.180789 H sapiens (clone S164) mRNA; 3' end of c Z1 

122815 AA461080 Hs.139446 ESTs 2.1 

105598 AA279439 H&20594 ESTs; Weakly simitar to misato [D.melano 2.1 

124869 R69088 Hs^8728 ESTs; Weakly similar to F55A12.9 [C.eieg 2.1 

129599 F10720 Hs.180804 ESTs 2.1 

110338 H40359 Hs.177256 ESTs 2.09 

134G92 H17490 Hs.7905 ESTs; Highly similar to sorting nexin 9 2.09 

133002 AF006082 Hs.62461 ARP2 (actin-related protein 2; yeast) ho Z09 

115570 AA398343 Hs.94943 ESTs 2,09 

120055 W93299 Hs.59363 ESTs; Weakly similar to cytokeratin 20 [ 2.09 

116332 AA491208 Hs.62620 ESTs 2.09 

105415 AA243768 Hs.4232 ESTs; Htghry similar to match to ESTs Z4 2.09 

116607 D80354 Hs^56321 EST 2.09 

126731 AA593973 Hs.232217 ESTs; WeaWy simitar to HIS ALU SUBFAMI 2.09 

102276 U30999 Hs.10247 activated leucocyte cell adhesion moleou 2.09 

113666 T96077 Hs.17738 EST 2.09 

101183 L19779 Hs.795 H2Ahlstone family; member O 2.09 

112177 R49025 Hs,22996 ESTs 2.09 

115038 AA252360 H&87968 ESTs 2.08 

109638 F04432 Hs.17904 ESTs 2.08 

109592 F02475 Hs^6370 ESTs 2.08 

133740 U68142 Hs.170160 RAB2; member RAS oncogene tam2y-Bke 2.08 

126716 AA031700 Hs.251962 ESTs 2.08 

124055 F10904 Hs.100516 H sapiens done 23605 mRNA seq 2.08 

113283 T66813 Hs.12947 EST 208 

120097 W95068 Hs£9621 ESTs 2.08 

102066 U08471 Hs.352 folate receptor 3 (gamma) 2.08 
108712 AA121993 zm24d1U1 Stratagene pancreas (#93728) 

simflar to gb:Y433 GLUTATHIONE PEROXJDAS 

134453 X70683 Hs.63484 SRY (sex determining region Y)-box 4 2.08 

103883 AA232836 Hs.87363 ESTs 2,08 

105313 AA233856 Hs.16930 ESTs 2.08 

113669 T96148 Hs.17762 ESTs 2.08 

120380 AA227904 Hs.104223 ESTs 2.08 

121045 AA398554 Hs.181012 doubl^stranded RNA-trfncfing zinc finger 2.08 

104949 AA070735 Hs.146090 ESTs 2.08 

118751 N74210 Hs^0454 EST 2.08 

112399 R60920 Hs.26419 H sapiens done 24510 mRNA seq 2.08 

129994 AA599443 Hs.38194 ESTs; Moderately simiiar to iliJ ALU SUB 2.08 

116402 AA600054 Hs.65302 ESTs 2.08 

125307 Z40583 Hs.101259 ESTs 2.08 

105047 AA132453 Hs.15396 ESTs 2.08 

128659 T95280 Hs.103315 trinucleotide repeat corrtaining 1 2.08 

122301 AA437378 Hs.98791 ESTs 2.08 

121974 AA429804 Hs.229675 EST 2.08 
1 16905 H71420 ys8rils1 Soares fetal fiver spleen 1 NFL 

3' similar to contains Aiu repetitive e 2.08 

106703 AA463979 Hs.21264 KIAA0782 protein Z08 

121908 AA427858 Hs.98534 EST Z07 

135119 T23992 Hs.94769 ESTs; Moderately simiiar to RAS-RELATED 2.07 

103558 Z19574 Hs£785 keratin 17 2.07 

124209 H57317 Hs.193433 ESTs 2.07 

133936 AA045083 Hs.77719 garnma-glutarnyl carboxylase 2.07 

116246 AA479961 Hs.42913 ESTs;Hlc^lysnTnlartoublqultirwx)njug Z07 

123230 AA490134 Hs.105308 EST 2.07 
127378 AA452696 zx39b05.ii Soajes.totaLfetusJib2HF8.9w 

to contains Alu repetitive etementoont 2.07 

110464 H53013 Hs^21901 ESTs 2.07 
135191 X07619 Hs.169876 cytochrome P450; subfamiry UD (debrisoq 

polypeptide 7a (pseudogene) 2.07 

101267 L36818 Hs.75339 inositol polyphosphate phosphatase-like 2.07 

105185 AA191495 Hs.189937 ESTs 2.07 

125366 H60192 Hs.76853 ESTs; Weakly similar to human homolog of Z07 

117472 N30131 Hs.93738 DKFZP434M098 protein £07 

114235 Z39710 H«l25341 ESTs 2.07 

109081 AA165268 Hs.72488 ESTs 2.07 

112596 R78212 Hs.163705 ESTs 2.07 

109254 AA194940 Hs.85956 ESTs; Weakly similar to fine-1 protein O 2.07 

105898 AA401144 Hs.27354 ESTs 2.07 

116290 AA488691 Hs.57969 phenylaianlne-tRNA synthetase 2.08 



Ui.SC.H69, Lu_LC_H460 r Lu_SC.H345 
Fibroblasts 2, PRSC_con, MCF7 
LNCaP.ceils, MCF7, OVCARjalls 
ULSCJH345, HT29_cefls, HMEC 

CALU6_ceHs, A549_ceBs, Lu_SC_H345 
PC3.cefls, DU145_ceils, LNCaR.celIs 
HT29_ceQs, BT474 cells, MB231_ceBs 
EB.ceBs, LU.SCJ1345, LNCaP.cells 
HT29_cells, BT474_ceIls, MB231_cells 
HS578T_ce0s, HT29.cells, HT29.celIs 
MCF7, A549.ceBs, MB-MDA435S 
EB_ce!ls, Fibroblasts 2, HS578T_ceDs 
EB.cells, HS578T ceDs, A549_cefis 
Lu_SCJl345, PC3 cells, LNCaP cells 
HMEC (toy RNA), HS578T_ceBsrHS578T cells 
EB.cells, luJtiJM, Lu.SC.H69 
LNCaP.ceils, Lu_AD_H23, MB«A-453 
LNCaP.ceils, DU145.ce&, RPWEJ* 
MB231 .cells, HT29_ceDs ( HMEC 
PC3.ceI!s, HS578T.ceBs, DU145_ce!ls 
ULAD.H23, LllAD,H23, Lu_SQ_H520 
LNCaP.ceils, MCF7, OVCAR_cefls 
LUjADJi23, Lu_AD.358, Lu.SC.H69 
BT474_ceDs, MB231 cells, HT29.cells 
EB.cells, DU145_ceIls, PC3_oeDs 
Lu_AD_H23, Lu.SO.H520, Lu.LC.H460 
LNCaP.ceils, MB-MDA-453, EB.ceBs 
HS578T_cells, Fibroblasts 2, Lu.SC.H69 
Lu.SC.H345, OVCAR^cefe, DU145.ceBs 
EB.cells, Lu_SC_H69, LuJU).H23 
HS578T_ceHs, A549_ceBs, CALU6.ce0s 
EB.cells, LilADJH23, Lu_AD_358 

2.08 LU.SCLH520, HT29_ceIls. BT474_cels 
EB.cells, Lu_SC.H345, Lu.SC.H69 
HT29.ce0s, 293T_cefls, 293T.ce!ls 
DU145_ceIis, MB-MDA435S, HS578T.cefls 
EB.cells, Lu.SQ_H520, Fibroblasts 2 
293T.cells l CALU6_cells, A549.ceUs 
293T.cells, PC3.ceDs, OVCARjslls 
Lu.SC.K69, Lu.SC.H345, RPWEJ 
LllAD.H23, Lu.SC.H69, Lu_SC_H345 
EB.cells, UlAD_H23, Lu.SC.H69 
DU145_cefls, EB.cells, HS578T.ce0s 
HT29.ceDs, BT474.cetrs, LilAD.H23 
HMEC, HMEC (total RNA), EB.cells 
Caco2, HT^.ceBs, LNCaP.ceils 
EB.cells, LilAD_H23, Lu.SC.H69 
LU.SC.H345. Lu_AD.H23, LuJ\D_358 
HS578T_cells, 293T_cefls, OVCAR.ceBs 

ULAD.H23, EB.cells, PRSC.con 
EB.cells, Caco2, PRSC.con 
293T_cens, Lu.SC.H345, CALU6_celis 
HS578T.ceils, PRSC.con, OVCAR.ceIls 
RPWEJ2. HMEC (total RNA), HMEC 
Fibroblasts 2, OVCARjsBs, 293T.cefls 
Rbrablasts 2, MB^DA-453, PRSC.con 
EB.cells, LNCaP.ceils. LNCaP.cefis 
UJLAD.H23, Lu.SC.H69, Lu.SC.H345 

HS578T.ceIIs, LNCaP.ceils, EB.cells 
Fibroblasts 2, Lu.SQ.H520, LU.SCLH520 

UjAD.H23, Lu_SC_H69, Lu_AD.358 
Lu.SC.H345, OVCAR_cells, Caco2 
Lu.SC.H69, UUD.H23, Lu.SC.H345 
DU145.ceDs, LuJ.CJ+460, Lu_AD_358 
EB.cells, Lu_SC_H69, 293T.celIs 
DU145.cefls, BT474_cells, Lu.SC.H69 
Lu.SC.H69, Lu.SC.H345, PC3.ceDs 
MB-MDA-435S, LU.SQJ1520, MB-MDA-453 
HS578T.cells, 293T_ceIls, OVCAR_ceDs 
EB.cells, 293T_ceBs, PRSC.con 
DJVD.H23, Lu.SC.H345, PRSCJog 
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122529 AA449828 
104612 R99199 
116465 AA621650 
123155 AA488414 

126752 AI073373 
126455 NB0749 
129339 R77869 
115021 AA252028 
129054 T67231 
101281 L35545 
132697 AA281951 
124380 N26536 
103967 AA303711 
119403 T92935 
125755 R66080 
101843 M93405 
113032 T24024 
112563 R72632 
126432 AA583625 
101636 M57763 
125174 W51835 
106168 AA425943 
135343 AA236796 
105267 AA227956 
134331 AA452020 
121634 AA417012 
131394 R72637 
111526 R08260 
125049 T79840 
120433 AA237077 
129498 AA449789 
127805 AA740921 
109275 AA196287 
120683 AA290987 
135415 X60655 
132925 AA252759 
101875 M97287 
101463 M22490 
129177 T95005 
130726 W88946 
105549 AA262417 
124543 N63706 
123062 AA482069 
109464 AA232857 
129619 AA610116 
127545 AA935809 
133068 R73427 
113609 T93263 
106645 AA460270 
126256 Z21124 
129697 R00841 
126730 T19477 
125244 W86466 
134762 M91036 
119564 W38206 
132523 AB002332 
127758 AI337031 
126471 AA158755 
110911 N45120 
122317 AA442742 
100253 D38024 
120431 AA236884 
122449 AA447638 
100961 J00148 
130908 W86389 
102643 U67849 
127932 AA398510 
109207 AA190906 
102598 U62962 
124470 N51702 
104961 AA076672 
124164 H30667 
126468 AA242853 
129683 W05348 
105350 AA235737 



Hs.99229 ESTs 206 

Hs.173063 bansdudn-Qke enhancer of split 2; horn 2.06 

Hs.41045 EST s; WeaWy similar to KIAA0734 protein 2.06 
Hs.76127 heel (homologous to the E6AP (UBE3A) ca 

domain (RLD) 1 2.06 

Hs.183275 ESTs 2.06 

Hs.111515 ESTs; Weakly similar to predicted using 2X6 

Hs.28506 ESTs 206 

Hs.39168 ESTs 206 
Hs.168289 succinate dehydrogenase complex; subunit . 206 

H&82353 endofliallal cell protein C/activated pro 206 
Hs.5518 H sapiens mRNA; cDNA DKFZp566J2146 (from 

Hs.64999 ATPase; Cu^+trareporting; beta poiypept 206 

Hs.144700 ephrfn-B1 2.06 

Hs.1 19908 ESTs; Highly similar to nucleolar prate! 2.06 
Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 
Hs.170008 methytmalonate-serrdaidehyde dehydrogenas 205 

Hs.7387 DKFZP564B1 16 protein 205 

Hs.29282 ESTs 205 

H&235860 ESTs 2.05 

Hs.89474 ADP-ribosyiation factor 6 205 

H&231082 EST 205 
Hs.82208 acyl-Coenzyme A dehydrogenase; very tong 205 

Hs.9914 folBstatin 2.05 

Hs.25348 folOstatin-iike 3 (secreted glycoprotei 2.05 

Hs,234156 ESTs; Weakly similar to CGH2B protein 205 

H&28921 ESTs 2.05 

H&26343 ESTs 205 

Hs20131 ESTs 205 

Hs.111798 ESTs 2.05 
Hs.180777 H sapiens mRNA; cDNA DKFZp554M0264 (from 

Hs.7551 1 connective tissue growth factor 2.05 

Hs.1 197 heat shock 10kD protein 1 (chaperonin 10 205 

Hs.20303 ESTs; Moderately similar to 1111 ALU SUB 205 

Hs.49657 ESTs; Weakly similar to contains similar 204 

Hs.99967 even-skipped homeo box 1 (homolog of Dro 204 

HS238296 DKFZP434A033 protein 2.04 

Hs.74592 spec^ATn1chseqbirKimgpro^n1(b 204 

Hs.68879 bone morphogenetlc protein 4 204 

Hs.209587 ESTs 204 

Hs.1 8508 putative glydne-N-acyltransf erase 204 

Hs.5415 ESTs 204 

Hs.104573 ESTs 204 

Hs.100847 ESTs 2.04 

Hs.87100 ESTs 204 

Hs.1 1663 tetraspanNET-6 protein 204 

Hs.115899 ESTs 204 

Hs235712 ESTs 204 

Hs.16875 ' ESTs; Weakly similar to hypothetical pro 204 

Hs.27695 midline 1 (Opitz/BBB syndrome) 204 
HSAAADNVE TEST1, Human adult Testis tissZ04 

Hs.172069 DKFZP434C212 protein 204 
A1426R Heart H sapiens cDNA clone A1426, 204 

Hs.132756 ESTs; Weakly similar to, KIAA0591 protein 204 

H&242985 hemoglobin; gamma G 204 

Accession not feted in Genbank 204 

Hs.50722 dock (mouse) homolog 2.04 

Hs.180195 ESTs 204 

Hs.175652 ESTs; Weakly sImBar to Ml ALU SUBFAMI 2.04 

HS22305 ESTs 2.03 

Hs.8693 ESTs; Weakly simBar to ill! ALU SUBFAMI 2.03 

Hs247951 Humnfactosc^ulchurneralmuscuterdystro 203 

Hs247323 H sapiens mRNA for G4 protein (G4 gene; 203 

Hs.104977 ESTs 203 

Accession not listed in Genbank 203 

Hs21122 ESTs; Moderately similar to KIAA0438 [H. 203 

Human bel^galactoside alpha2,6-sialyitr 203 

Hs.133146 ESTs 203 

Hs.204692 ESTs 203 

Hs.106673 eukaryotictranslationlnitation factor 2.03 

Hs.101392 ESTs 203 

Hs.33905 ESTs 2.03 

Hs.7535 ESTs; Highly similar to COBW-IIke placen 2.03 

H&237856 ESTs; Moderately similar to cAMP inducib 2.03 

Hs.158196 DKFZP434B103 protein 2.03 

Hs.186571 ATPase; N»»/K+ transporting; alpha 3 pol 2.03 



DU145_cells, HSaT&T.ceOs, 293T_ceIls 
MB-MDA435S, 293T_ceDs, 293T.cete 
MB231_ceDs t HT29.cete, Lu_AD_358 

DU145_ceIis t CALUe.ceBs, PC3.ce0s 

LNCaP.cefls, EB.ceOs, DU145_ceJIs 

CAW6_ceUs, PRSCJog, OVCAR_cete 

EB.cells, BT474_ceBs l LuJU)_H23 

UlSQ_H520 p Fibroblasts 2, EB.cells 

Caco2, LNCaP.cefls, EB celts 

EB.ceOs, RPWEJ, DU145_celis 

206 OVCARjslls, LNCaP.cells, DU145.cells 

08002, Caco2, 293T ceOs 

HT29_cells, HMEC (total RNA), HMEC 

HMEC, EB.ceOs, HMEC (total RNA) 

206 LMCaP.ceils, DU145_ce!ls, OVCAFLceRs 

LNCaP.cells, MB-MDA-453, EB.ceOs 

EB^cells, A549.ceils, A549 ceOs 

MCF7, HS578T_ceQs, PRSC.con 

MB231_ce0s, HT29_cel!s, Fibroblasts 2 

DU145_cells, LNCaP.cells, PC3_ce0s 

EB.ceOs, Fibroblasts 2, Lu_AD_H23 

OVCAR^cete, PC3_ceQs ( EB.cells 

HMEC (total RNA), PC3_cells f HMEC 

HMEC, RPWEJ, HMEC (total RNA) 

EB_cefis, CALUG.ceSs, A549_ceDs 

HS578T.ceDs, EB.ceOs, Lu.SC.H345 

EB.cells, Lu_SC^69, hlJDjm 

LiLADJi23 r Lu.SC.H69, BT474.ce0s 

HT29_ceBs, LUJVD.H23, Lu_SC_H345 

205 DU145_ce!ls, CALU6jse0s, PC3.ceIls 

HS578T.ceIls, PRSCJog, PRSC.con 

DU145.ceiIs, LNCaP.cefls, OVCAR_ceDs 

EB.ceOs, MB-M0A-453, Fibroblasts 2 

UlAD.358, Lu_SCLH520, Lu_LC_H460 

ULADJC3, RPWEJ, LU_SQ_H520 

293T_ceUs, HS570T.cells, LMCaP.cels 

EB.cells, Lu_SC_H69, 293T_ceBs 

PRSC.con, HT29_cete, MB231 .ceils 

293T.ceIls, MB-MDA4353, Lu.SC.H69 

HT29_ceIls, Fibroblasts 2 MB-MDA435s 

DU145.cells, OVCARjaDs, PC3_cefls 

Caco2, 293T.ce0s, DU145_cefc 

LilAD.358, HT29.cells, HT29_cells 

DU145_ceDs, ULAD.H23, LNCaP.cefls 

BT474_ceUs, Caco2, LNCaP.ceBs 

BT474.ceBs, MB-MDA-435S, M&MDM53 

Caco2, OVCAR_ceQs, MCF7 

EB.ceOs, Lu.SC_H345, PRSC.con 

A549_cells, 293T_ce0s, Caco2 

Fibroblasts 2, Fftroblasts 2, MCF7 

HT29.ceIls, Lu_SQJH520, BT474.celis 

EB.ceOs, UlAD_H23, Lu_SC.H69 

EB.ceOs, ULAD.H23, Lu_LC_H460 

MB231.cefls, LuJ\D_358, HT29.cells 

BT474.ceHs > HT29_ceBs, Lu_AD_H23 

PC3_cete, OVCAFLcells, PRSCJog 

293T.ce0s, MB-fVff)A435s, A549_cells 

EB.oelis, LuJU).358, Lu_LC_H460 

UJ\D_H23, RPWEJ, Lu_LC.H460 

EB.cells, Fibroblasts, Lu_SC.H345 

ULADJ123, LuJ^D.358, Lu.SQ.H520 

Lu.SC.H69, EB.cells, Lu.SC.H345 

LU.SCJH345, Lu.SC.H345, U.SQJI520 

HT29.cefls, BT474_ceils, HMEC 

293T.celis, Lu.SCJ^345, OVCAR.ce0s 

HT29_ceils, 293T.ceBs, Lu„SC.H345 

EB.ceOs, Lu_SC_H345, Lu.SC_H69 

Lu.SQ.H520, Lu.SC.H345, Lu_SC.H69 

EB.cells, DU145.ceils, MCF7 

HT29_ceils, Fibroblasts 2, HMEC (total RNA) 

Caco2, LNCaP.cells, EB.cells 

CALU6.cells, CALU6_ceDs, A549.cefls 

MB231 ceils, BT474_ce!ls, RbTOblasts2 

HT29 ceOs, MB-MDA-435s, Lu_ADJi23 

MB-MDA-453, Lu.SQ.H520, LilAD.358 
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129794 AA447772 Hs.14520 eukaryoflc translation Initiation factor 103 

115664 AA405974 Hs.54673 tumor necrosis factor (ligand) superfemi 2.03 

119096 R41672 Hs.91471 ATPase type IV; phospholipid transportirt 2.03 

133866 L36151 Hs.171625 phosphatidyllnositol 4-Wnase; catalytic 2.03 

132055 N69440 Hs.38132 ESTs ' 2.03 

125691 A1034361 Hs.135150 lung type-l cell membrane^ssociatBd gty 2.03 

121376 AA405699 Hs.166232 ESTs; Moderately similar to SODIUM- AND 

TRANSPORTER 2 [H.saptens] 2.03 

105289 AA233178 Hs,103000 WAA0831 protein 2.02 

100967 J02621 Hs.251064 high-mobility group (nonhistone chromoso 102 

124430 N38913 Hs.221575 ESTs 2.02 

128322 AI306331 Hs.133296 ESTs 242 

131077 X91B09 Hs.22698 G alpha Interacting protein 2.02 

108033 AA040923 Hs.92200 K1AA0480 gene product 2.02 

107550 AA001045 Hs.46783 ESTs 102 

109475 AA233159 Hs.87131 ESTs 102 

111400 R00144 Hs.189771 ESTs 102 

117516 N32495 Hs.151560 ESTs 102 

120506 AA257955 Hs.173705 ESTs; Weakly simter to !!!! ALU CLASS C 2.02 

130850 N39306 Hs.20237 DKFZP566C1 34 protein 102 

123118 AA486571 Hs.105696 ESTs; Moderately similar to 11!) ALU SUB 2.02 

111285 N71704 Hs.4310 eukaryotic translation Initiation factor 2.02 

119106 R42362 Hs.91785 ESTs 2.02 

111370 N92915 Hs.94631 brefeldh A-inhibited guanine nucleotide 102 

125013 T67261 Hs.154431 EST s; Weakly similar to neuronal thread 2.02 

129762 AA460273 Hs,12372 K1AA0517 protein 2.02 

120704 AA291970 H&107Q54 WAA0821 protein 2.01 

105355 AA235985 Hs. 26938 Human DNA seq from done 126A5 on chromo 

genes (one with DnaJ domafos); the gene 

famfly member HKR3. Contains ESTs; STSs; 2.01 

125952 AA017723 small Inducible cytokine A5 (RANTES) 2.01 

103478 Y07755 Hs.38991 S100 caldurn-binding protein A2 101 

133544 T33873 Hs.74624 protein tyrosine phosphatase; receptor t 101 
112746 R93237 yq11e10.s1 Scares fetal Ever spleen 1NF 

lMAGE:1966503\mRNAseq. 101 

.116513 N67504 Hs.40061 ESTs 101 

123423 AA598484 Hs.238476 EST 101 
127854 AA769520 ESTs; Weakly similar to REGULATOR OF MFT 
Lu.SQ_K520 

111843 R36969 Hs.18888 ESTs 101 
100221 D28383 Human mRNA for ATP synthase B chain, SU 101 

129966 AA452237 Hs. 194443 ESTs; Weakly similar to BC37295 J [H^ap 101 

106798 AA478968 Hs.20558 ESTs 101 
1 14636 AA085374 zn13d5.s1 Stratagene hfJT neuron (#937233 

gb:L8441 CYTOCHROME C OXIDASE POLYPEPTI 

125348 H21585 Hs.191 277 ESTs; Moderately similar to ATP binding 101 

130620 AA233245 Hs.16773 ESTs 101 

106471 AA450118 Hil25722 ESTs; Weaidy similar to ZINC FWGERPROT 101 

134175 T33128 Hs.7966 ESTs 2 

117291 N22289 yw36g08.s1 Morton Fetal Cochlea H sapien 2 

134199 U47635 Hs.79877 rnyotubularin related protein 6 2 

128758 AA129545 Hs.181165 eukaryotic translation elongation factor 2 

112005 R42569 Hs.22444 ESTs 2 

122521 AA449433 Hs.149227 ESTs; Weakly similar to PROUNE-KICH PR02 

130356 X84373 Hs.155017 nudear receptor interacting protein 1 2 

114067 Z38153 Hs.26921 ESTs 2 

107136 AA620795 Hs.8207 ESTs 2 



EB.celis, UJUD.358, LuJVD_H23 
LuJU>.35B, HT29_ceIls, HT29.cells 
HT29_ceHs, MB231_ceils, BT474_ceIb 
293T.ceBs, DU145_cefls, LNCaP.ceOs 
Lu.SC_.H345, MB-MDA-453, MB4tfDA435s 
Lu_SC_H345, LNCaP.cells, DU145.cells 

LNCaP.cells, HT29.ce»s, RPW£J 
PC3.ceIl9, Lu_AD_H23, MB231.ceBs 
MCF7, DUMS.ceDs, OVCAR_ceiis 
MB-MDA-435S, Fibroblasts 2, EB.celis 
HT29.cells, MB-MDA435S, Lu.SC.H345 
UlAD_H23, RPWEJ, MCF7 
MCF7, Fibroblasts 2, DU145.ceJls 
DU145.cefls, PC3.cefls, OVCARjslls 
KT29_cefls, MB-MDA-435S, Lu_SC_H69 
HT29_ceDs, Fibroblasts 2, HMEC 
KT29_ce0s, HMEC (total RNA), Fibroblasts 2 
MCF7, Ftorobfastsl WCaP.ceBs 
EB.celis, Lu_AD_H23, LlLC_H460 
CALU6 cells, 293T_celte, PRSC.log 
293T_cells, PC3_celis, EB.cefls 
CALU6.cetts, MB-MDA-453, PC3_celb 
EB.celis, OVCAFLceQs, LNCaP.cells 
Lu.SC.H345, Lu_SC_H69 ( PRSC.con 
EB.celis, MB-MDA-435s, OVCARjslls 
Lu.SC.H69, EB.cefls, MB-MDA-453 



LuJVD_H23, Lu.LC.H460, Lu.SQ.H520 
LNCaP.cells, DU145_ceHs, MB231.ceIls 
HMEC (total RNA), HMEC, RPWEJ2 
Lu_SC_H345, BT474_ce0s, HT29_celis 

PC3_celb, LNCaP.cells. OVCARjsOs 
LU.SC.H345, Lu_.SC.H69, PRSC.con 
EB.celis, Lu_AD_H23, Lu_SC.H345 
101 HS578T.ceQs, CALU6_cells, 

Lu_AD_H23, Lu_AD_358, Lu.SQ.H520 
EB.ceDs, LUJULH23, LNCaP.cells 
Lu.SC.H345, Lu.SC.H69, DU145_ceBs 
EB.cefls, UlAD_H23, Lu.LC.H460 

101 EB.celis, CALU6_ceDs, OVCAR^cells 
£8.0683, HS578T.cefls, PC3_ce0s 
EB.celis, DU145.ce0s, 293T_cefls 
OVCAR.cefls, LNCaP.cells, EB.celis 
Lu.SC.H345, F&robiasts2, h)JDJU2 
MB-MDA-453, OVCAR_ce0s, CALU6.cells 
EB ceils, PC3_cells, LNCaP.cells 
Ui_SC_H69, EB.cefls, Lu_SC_H345 
Lu_AD_H23, PRSCJog, hiJDJSB 
HT29.ceQs, RPWE_2, MB231.cells 
DU145.ce8s, PC3.ceHs, MCF7 
293T.ceHs, MB-MDA-435S, HT29_ceJts 
LNCaP.cells, PC3.ceUs, EB.celis 
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Table 3 



Pkey: Unique Bos probeset identifier number 
ExAccn: Exemplar Accession munb 
UnigenelD: Umgene number 
Umgene Tftle: Umgene gene title 



Pkey ExAccn 


UGJD Complete.THie 


Ratio BS/Met 


Top 3 expressing ceo lines 


3D2347 AF039400 


Hs.194659 chloride channel; calcium activated; fam 


19.71. 


EB, NCI-H520, NCMH23 


316304 AI936587 


Hs.221599 ESTs 


14.49 


PRSC.con, RPWE-2, OVCA-R 


339136 


CH2a.rT113D11.GENSCAN.3-1 


1037 


NCI-H69, PRSC.con, NCI-H345 


336171 


CH2a.FGENES.708 3 


9.45 


NCI-H69, NCI-H460, NCI-H23 


338895 


(Ma_DJ32l10.GENSCAN.9-2 


9.31 


PC3.BT474, OVCA-R 


333625 


CH22-FGENES.223J 


8.96 


NCWtt9 f PRSC.con, NCI-H345 


333730 


CH22.FGENES.258J 


8.82 


NCI-H69, BT474, MB-MDA-231 


320244 AA296922 Hs. 129778 gastrointestinal peptide 


8.22 


BT474, CALU6, DU145 


333643 


CH2^FGENES^32_2 


7.66 


MCF7, NCI-H69, LnCep 


333423 


CH22LFGENES.147.3 


7.57 


HT29, MB-MDA-231, EB 


302332 AI833168 


Hs.184507 H sapiens Chromosome 16 BAC done CIT987 


755 MB-MDA-231 , HT29, MB-MDA-* 


333588 


CrC2_FGENES.206.2 


7.46 


HT29, OVCA-R, BT474 


322033 AL137507 


EST cluster (not in UniGene) 


735 


PRSC.con, PRSCJog, NCI-H345 


308601 A1719930 


EST singleton (not h UniGene) with exon 


633 


PC3, DU145, DU145 


339044 


CH22JDA59H18.GENSCAN.27<5 


6.46 


NC1-H69, NCI-H345, PRSCJog 


314516 AA371513 Hs^31748 ESTs 


6.41 


EB, OVCA-R, Caco2 


327805 


CH.05_hsgI)5867968 


628 


NCI-H69, NCI-H345, PRSC.con 


334239 


CH2^GENES^64J 


6.09 


NCI-H520, MB-MDA-435S, MB-MDA453 


332958 


CH2a.FGENES.48J5 


6.04 


NCI-H69, PRSC_con, PRSCJog 


313386 W85772 


Hs.173924 ESTs 


5.68 


MWtfDA-231, OVCA-R, BT474 


314350 AL037927 


Hs.190675 ESTs;Moderatelysimflartol!HALUSUB 


5.84 


OVCA-R, CALU6, EB 


337170 


CH22J r GENES^64-1 


5.67 


LnCap, CALU6, NCI-H69 


337503 


CH22.FGENES.803-1 


5.66 


NCI-H345, PRSC_con, RPWE-2 


337562 


CH2a.C65ElGENSCAN.1-2 


5.53 


HT29, MB-MDA-453, BT474 


337219 


CH22.FGENES.6144 


5.45 


NCI-H69, NCI-H345, PRSCJog 


311331 A1679622 


Hs.32225 immunoglobulin alpha 1 


5.43 


NCI-H69, NCI-H23, NCI-H345 


314251 AA713589 


EST duster (not in UniGene) 


5.41 


PC3,EB,LnCap 


336246 


CH22J r GENES.746_5 


534 


NCI-H69, NCI-H345, PRSCJog 


335009 


CH2a.FGENES.47i.13 


531 


EB, EB,NCI-H69 


339365 


CH2a3A354i12.GENSCAN.34-1 


5.25 


PRSC.con, NCI-H69, PRSCJog 


336088 


CH22_FGENES.688.17 


521 


PRSC.con, Caco2, PRSCJog 


334966 


CH2ajGENES.465J6 


5.16 


DU145.BT474, MB-MDA-231 


334666 


CH23/GENE&418J8 


5.15 


NCW69, NCI-H345, PRSC.tog 


316830 AW182106 Hs.127821 ESTs 


5.12 


NCW345, PRSC.con, PRSCJog 


339413 


CH2ajXi579N16.GENSCAN.54 


5.06 


NCW69.NO-H345, PRSCJog 


337951 


CH22LEMAC005500.GENSCAN.94-1 


5.01 


NCW345, NCI-H69, PRSC.con 


330153 


CR21_p2gi|4325335 


5 


PRSC.con, PRSCJog, NCI-H69 


333987 


CH22_FGENES310.11 


4.96 


M&M)A-231, MB-MDA-453, MB-MDA-453 


334304 


CH22.FGENES373.7 


4.96 


OVCA-R, CALU6, NCI-H23 


338990 


CH22_DA59H18.GENSCAN.^6 


4.95 


PRSCJog, PRSC.con, NCI-H69 


333152 


CH22.FGENES39.1 


4.89 


MB-MDA-435s, OVCA-R, A549 


327049 


CH.21.hsgiJ6531965 


4.87 


PRSC.con, NCI-H345, PRSCJog 


337225 


CH2ajGENES.62&3 


4.83 


niMie f*A\ 1 Ifi CD 

QUI 45, GALUo, as 


333496 


CH22J r GENES.168_6 


431 


NCI-H69, NCI-H345, PRSC.con 


334451 


CH22.FGENES387J1 


4.79 


RPWE-2, PRSC.con, NCW69 


333594 


CH22_FGENES.210_3 


4.78 


OVCA*,PC3,rfT29 


333635 


CH22.FGENES.228.2 


4.78 


NCI-H69, PRSCJog, PRSC.con 


336796 


CH22.FGENES.17M 


4.73 


NCI-H69, NCI-H345, PRSC.tog 


333313 


CH22.FGENES.138.5 


4.72 


NO-H69,NCI-H345, PRSC.tog 


336833 


CH22.FGENES.242-2 


4.7 


NCWH345, NCI-H69, PRSC.con 


336090 


CH2a.FGENES.689J 


4.7 


NCI-H69, PRSC.con, PRSCJog 


336645 


CH22.FGENES26-1 


4.63 


HT29, OVCA-R, DU145 


334565 


CH22.FGENES.4Q5.5 


4.62 


NC1-H345. PRSC.bg, RPWE-2 


333242 


CH22.FGENES.111.6 


4.56 


NCI-H345, PRSC.bg, PRSC.con 


326304 


CH.17Jwgi|5867277 


4.48 


OVCA-R, EB.DU145 


337445 


CH2a.FGENES.7694 


4.47 


RPWE-2, NCI-H69, PRSC.bg 


327413 


CH.02_hsgi|5867750 


4.46 


NC1-H69, PRSCJog, NC1-H345 


327990 


CH.06_hsgi|5868218 


4.44 


PRSC.con, PRSC.bg, RPWE-2 


325038 H38304 


Ha.21782 ESTs 


4.43 


PRSC.con, MB-MDA-231, HT29 


314923 AI732489 


Hs.136370 ESTs 


4.4 


HT29, MB-MDA-231, NCI-358 


328859 


CH.07.hsgi[6381928 


4.4 


OVCA-R, BT474.A549 


334476 


CH22J r GENES394 7 


438 


OVCA-R, PC3,EB 


336092 


CH22_FGENES.689l6 


435 


PRSC.con, Caco2, PRSCJog 


333965 


CH22_FGENES3Q5_3 


435 


NCI-H69.NCW345, PRSCJog 
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niyyi C^CUCO OOO 47 

CHZ2_rGcNto^Z3«17 


A OX 

4.34 




CH22_B/lAOT5500.e^SCAN.91K» 


A 04 

4.33 


337504 


CTO2_FGENES.803-2 


4.33 


00CQ4O 


CH22.FGENES.213-6 


4.33 


338069 


CH22.EMAC005500.GENSCAN.166-14 


4.33 


318538 N28625 


Hs.74034 caveolin 1; caveolae protein; 22kD 


4.31 


333531 


CH2ILFGENES.227.2 


4.3 


302646 M14268 


EST 


4.Z7 


336049 


CH22.FGENES.6B1J 


4.26 


335667 


CH22JGENES.590J8 


4.25 


320352 Y13323 


Hs.145296 dfeintegrin protease 


4.25 


304480 AA430373 


EST singleton (not In UniGene) with axon 


4.22 


327273 


CH.01j5sgi|5867466 


4.22 


334540 


CH22^FGENES.403_5 


4.17 


334719 


CH^GENES^IJO 


4.17 


327827 


CH.05_hs gl|5fi67968 


4.17 


333599 


CH22_FGENES.212_2 


4.17 


329638 


CH.12j)2gI{3779004 


'4.16 


307556 A1281651 


EST singleton (not in UniGene) with axon 


4.16 


336836 


CH22.FGENES.247-11 


4.15 


323187 AL121180 


Hs.240038 ESTs 


4.14 


336397 


CH22LFGENES.823J2 


4.13 


325007 AA736429 


EST cluster (not in UniGene) 


4.13 


300199 AI304386 


Hs.150836 ESTs 


4.11 


335832 


CH2*J r GENES.62D_6 


4.08 


312778 AW31655 


Hs.197919 ESTs 


4.07 


323164 AA765301 


Hs.151858 ESTs 


4.06 


315871 AW135312 Hs.117237 ESTs 


4.05 


337452 


CH22LFGENES.775-1 


4.02 


335265 


CH22JGENES521J 


4.01 


335200 


CH22_FGENES.508_9 


4.01 


336917 


CH22.FGENES.346-4 


3.99 


336564 


CH2£JGENES.847J 


3.98 


333382 


CH22.FGENES.143J1 


3.97 


329436 


•CH.YJtsglj5868883 


3.07 


336929 


CH2*_FGENES.34W 


334 


337238 


CH22J r GENES.641-3 


3.92 


333875 


CH22_FGENES.291.11 


3.92 


337069 


CH2LFGENES.448-2 


3.9 


332491 M24470 


Hs.1435 guanosine monophosphate reductase 


3.86 


304623 AA521331 


EST singleton (not in UniGene) with axon 


3.86 


335348 


CH22_FGENES.537_4 


3.85 


334568 


CH2a.FGENES.405J 


335 


336924 


CH22_FGENES.347-9 


334 


301654 H81795 


EST 


3.84 


334677 


CH^GENESMJO 


333 


326688 


CH.20_hsgij58575B2 


333 


327790 


CH.05Jisgi|5867977 


33 


334591 


CH2LFGENES.408J 


3.8 


337974 


CH22JM^C005500.GENSCAN.10&3 


3.78 


311274 AW293128 Hs.197101 ESTs 


3.78 


326668 


CH.20Jsgi]6552455 


3.78 


304195 N35382 


EST singleton (not in UniGene) with exon 


3.77 


336294 


CH22_FGENES.786_4 


3.77 


311613 AL046311 


Hs.252443 ESTs; Weakly similar to il!l ALU SUBFAMI 


3.76 


338123 


CH22JB4^C005500.GENSCAN.19M 


3.75 


318230 AA558125 


EST duster (not in UniGene) 


3.74 


303985 AW514501 Hs.156110 Immunogfobufin kappa variable 1M 


3.73 


336502 


CHZLFGENES.833J 


3.72 


334063 


CH22_FGENES.327_17 


3.71 


333600 


CH22_FGENESi13J 


3.7 


339424 


CH22JXJ579N16.GENSCAN.14-3 


3.69 


336862 


CH22.FGENES.297-2 


3.67 


334823 


CH22.FGENES.437J5 


3.67 


329940 


CH.16j>2gi]6165199 


3.66 


300275 AI632123 


Hs^31521 ESTs 


3.66 


328820 


CH.07JsgB5868330 


3.66 


332398 AA446446 Hs.104788 H sapiens done 24554 unknown mRNA 


3.66 


325791 


CH.14_hsgi|6682476 


3.65 


300672 R14469 


Hs^56573 ESTS 


3.65 


338344 


CH2?_EMAC005500.GENSCAN.312^ 


3.65 


333257 


CH22JFGENES.118.5 


3.65 


332140 AA620724 


Hs.1 12890 ESTs 


3.65 


337489 


CH22.FGENES.799-2 


3.63 


305167 AA663080 


EST singleton (not in UniGene) with axon 


3.63 


336200 


CH22JGENES.719J 


3.61 
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RPWE-2, HT29, OVCA-R 
OVCA-R, DU145.PC3 
NCI-H345, PRSC.con, PRSCJog 
DU145, HT29, OVGA-R 
NCW69, PRSC.con. NCW345 
PC3,A549,BT474 
OVCA-R, PRSC.con, LnCap 
PRSC.con, PRSCJog, RPWE-2 
HT29,DU145,DU145 
Na-H520 l Caco2,MB-MDA-453 
MBJtfDA-231,DU145,BT474 
NCI^8,NCW460,NCI-H23 
NCUmNCI-H345, PRSC.con 
NOH69.NCI-H345, PRSCJog 
NCI-H69, NCI-H345, RPWE-2 
OVCA-*,NCI-H69,CALU6 
PRSCJog, NCI-H69, PRSC.con 
DU145.MWrfDA-231.HT29 
BT474, HT29, CALU6 
PRSC.con,NCW345 l NC!-H69 
NCI-H345, MB-MDA-435S, RPWE-2 
NCI-H345, PRSC.con. RPWE-2 
NCI-H69, PRSC.con, NCI-H345 
NCI-H345. PRSC.con, PRSCJog 
Nd-H69.NCi4i345.PRSC_!og 
NCW58, NC1-H23, PRSC.con 
NCI-H23.A549.HT29 
MMffl>fc231 f E&MCF7 
PRSC.con, PRSCJog, NCI-H345 
NCI-H69, MCF7, RPWE-2 
NCI-H69, PRSCJog, PRSC.con 
PRSC.con, NC I-H345, PRSCJog 
PRSCJog, PRSC.con, RPWE-2 
EB.A549.HT29 
BT474,PC3,HT29 
NCI-H69, NCI-H345, PRSCJog 
NCI-H69.NCI-H345, PRSCJog 
PRSC.con, RPWE-2, PRSCJog 
NQ-H69, LnCap, RPWE-2 
OVCA-R, MB-MDA-435S, CALU6 
OVCA-fl.DU145.PC3 ■ 
HT29.MB-MDA-231.PC3 
NCI-H69, NCI-H345, PRSCJog 
NCI-H345, PRSC.bg, RPWE-2 
NCW520, LnCap, NCJ-358 
PRSCcon, NCI-H345, NCI-H69 
NCI-H345, PRSC.con, PRSCJog 
PRSC.con, PRSCJog, NCW345 
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PRSC.tag, PRSC.con, NCI-H345 
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M&WDA-231.HT29.BT474 
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NCI-H69, PRSCJog, RPWE-2 
LnCap, NCI-H69,Caco2 
NCW345, PRSC.con, PRSCJog 
NCI-H69, NOH345, PRSC.con 
PRSCJog, PRSC.con, RPWE-2 
PRSC.con, NCI-H69, PRSCJog 
NCI-H345, NCI-H69, RPWE-2 
NC1-H69.NCW345, PRSCJog 
PRSC.con, PRSCJog, RPWE-2 
MB-MDA-231,BT474,BT474 
MB-MDA-453, LnCap, Caoo2 
DU145,EB, OVCA-R 
NCI-H69, NCI-H345, PRSC.con 
BT474, MCF7, EB 
OVCA-REB,MB4vlDA-453 
PRSC_con, NCW69, RPWE-2 
NCI-H69, NC1-H345, PRSC.con 
PRSC.tog, RPWE-2, PRSCjan 
DU145, OVCA-R, PC3 
EB, MB-MDA-453, MCF7 
PRSC.con, PRSCJog, RPWE-2 
3.1 PC3, MB-MDA-453, DU145 
PRSC.con, RPWE-2, PRSCJog 
RPWE-2, NC1-H345, PRSCJog 
EB, CALU6, PC3 
NCWH345, PRSCjan, RPWE-2 
NCI-H69,NCI-H345, PRSCJog 
NCI-H69.NCI-H345, PRSCJog 
LnCap, PRSC.con, DU145 
NCI-H345, PRSC.con, NCW69 
PRSC.con, PRSCJog, NCI-H345 
PRSC_con, PRSCJog, NCW69 
NCW69, PRSC.con, PRSCJog 
PRSCJog, NC1-H69, NC1-H345 
HT29,BT474,MB-M>A-231 
NO-H69.NCW345, PRSC.con 
BT474,MB4/!DA-435s 1 MCF7 
NO-H345, RPWE-2, PRSCJog 
EB,A549,CALU6 
OVCA-R, EB, MB-MDA-453 
NO+169. PRSCJog, PRSC.con 
HT29,BT474,MB-MDA«231 
NCI-H69, NCI-H345, RPWE-2 
PRSCJog, PRSC.con, RPWE-2 
BT474.MB-MDA^31,HT29 
PC3,BT474,MBM)A-231 
PRSCjwn, PRSCJog, RPWE-2 
NCI-H345, RPWE-1 PRSCJog 
EB.DU145.PC3 
MB-MDA-435s, HT29.BT474 
CALU6 r EB,DU145 
NCI-H345, RPWE-2, PRSC.con 
MB-MDA-453, NCI-H345, RPWE-2 
NCI-H345, PRSC.con, NCW469 
NC1-H69, NCI-H345, PRSC.con 
LnCap, NCW345.MCF7 
Caco2,HT29,NCW5B 
NCI-H69, PRSC.con, RPWE-2 
HT29, OVCA-R, A549 
PRSC.con, RPWE-2, PRSCJog 
NCI-H345, NCI-H69, RPWE-2 
Caco2, NCI-H460, A549 
PRSC.con, PRSC.tog, NOH345 
NCI-H69, PRSCJog, NCI-H345 
NCI-H520, NCI-356, MB-MDA-453 
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PRSC.con, NCUH345, PRSCJog 
EB,DU145,MB-MDA-435s 
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NCI-H69, PRSC.con, PRSC.con 
NCI-H345, PRSC.con, PRSCJog 
NCIWNCI-H345, PRSCJog 
NCI+T69, NCI-H345, NCI-358 
PC3, OVCA-R, BT474 
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288 


NCI-H345, PRSC_tog, rnoC„C0n 


329041 


CH.X_hsgi(586&5o4 


286 


Lncap, PRSC.con, RPWt-2 


333541 


CH22J*GENco.17o__o 


2.86 • 


MPI UCQ MPI UQAC DDCP <wn 

NUi-noy, NCI-H045, rRoL_C0n 


337011 


CH22_FGENES.427-6 


2.86 


mpi ucn bdo/* 1m noeo 

nckio9, PRSCtog, PRoC_con 


324031 


AA375646 EST duster (not In UniGene) 


286 


NCJ-H345, PRSCjog, Lncap 


331842 


niiern^ U.AB4M ror* 

AA416586 Hs.98232 ESTs 


2.86 


DU145, OVGA-R, HT29 


336599 


CH22_FGENES^50_3 


2.85 


i »r , n » M/^i uon kiPi uixe 

UlCap, NCK109, NCW1345 


337588 


CH22LC20H1 2.GENSCAN.5-4 


285 


k try 1 unie 1 ijaa nnpn 

NCI-H345, NC1-H69, PRSC.COn 


336177 


CH22_FGENES.712_2 


285 


NCK1D9, PRSCjog, RPWc-2 


337522 


Aiinn F^CklCC 04A 4 

CK22_FGENeS.o19-1 


285 


PAI 1 IC PV/PA D UTIfl 

CALU6, OVCA-R, HT29 


338596 


CH22_ENtAC005500.GENSCAN.437-2 


285 


NCI-H69, PRSC_con, NCI-H345 


309522 


(MM CAAi J 1 1_ ApWCft -w. i 1 t ryi 

AW 150 044 Hs252259 riDOSomal protein S3 


285 


MB-MDA-4D0, MB-MUA-40DS, Md-ML/A-4oOS 


000981 


Cn22J-GcNco.o97-7 


285 


MPl UCO DDCP am DDCP Inn 

NUKioy, PRoCj3Gf1, rRoo_log 


330286 


CH-OojlZ gf{0071910 


2.84 


MPI UOAC DDCP Im MPI UCQ 

Ni»l-no4o, rRoujog, Nirhnoy 


333713 


CH22_FGENES.251J2 


1 Oil 

2.84 


DQiir 0 BDCr> m»m MPI UCO 

RPWc-2, PR5C_00n, NCI-HB9 


335068 


CH22^FGENES.483_5 


283 


MdhwDA-207, NCWi345, RPWc-2 


305075 


AA641288 Hs.181165 euxaryolic translation elongaton factor 


283 


EB, LnCap, DU145 


326380 


CH. 19J13 QI(5oD7027 


2.B2 


MPIJJRQ DDCP mwi DDCP livi 


334970 


CH22_FGENES.466_3 


2.82 


PRSCjCOn, NCkto9, RPWc-2 


337097 


/si ma rr»ri rrr* j*v j ^ 

CrBtLFGEhES^I-l 


2.82 


NCI-n345, NCI-H69, PRSCjog 


323676 


A I7A4S4C C f*T -1, ,-4__ /__! l_ 1 ___\ 

AI702835 EST cluster (not in UniGene) 


2.62 


LnCap, A549, CALUo 


333785 


CH22_FGENES^74_4 


282 


OVCA-R, CaC02, IVB-MDA-453 


334175 


CH22_FGENES349 J 0 


2.81 


nruiir n rrr A~f A linn 

RPWc-2, BT474, MCP7 


444A0F 

337865 


AIJAA Cll,A A ftACeAAr>ClilP R All it c 

CH22 w EMACUD5500.GENSCAN.4&-5 


281 


Caco2, NCm23, di 474 


302585 


AA083564 Hs^49220 H sapiens mRNA for hTbr2; comptets cos 


281 


EB, DU145, MB-MDA-453 


<94eM4 

000023 


Crl22J-GcNcS.4-5 


2.81 


MPI LSOAC DDCP mh klT»l UCQ 

NCI*Ho45, PRSC_C0n, NChH59 


332854 


CHZ2_pGcNcS^2_1 


28 


DDtAfC O DDCP Im. DDCP mm» 

RPWc-2, PRSCjog, PRSC.con 


336978 


CH22_FGENES.384-10 


28 


PRSCjCOn, NCkm45, RPWE-2 


326874 


All M L .ilAMnfM 

CH^0_hsgi|6682507 


2& 


nmiir 0 . i/% 1 uo4c nnM 1 

RPWE-2, NCI41345, PRSC.bg 


315121 


AA565011 Hs.105902 ESTs 


28 


NCI-H345, PRSCjog, RPWE-2 


311185 


A1638294 Hs.224665 ESTs 


28 


NCI-H69, NCht345, PRSC_tog 


334682 


CH22.FGENES.41 9_4 


2.8 


NCI-H69, PRSC^log, RPWE-2 


316845 


AW418715 Hs750388 ESTs 


279 


RPWE-2, NCI-H345, PRSC_log 


331599 


N74626 Hs.50535 ESTs 


2.79 


A549, MB-MDA-453, MB-MDA-435S 


315681 


AW022054 Hs.138591 ESTs 


2.78 


NCI-H460, MB-MDA-453, MCF7 


313012 


AI207390 Hs.143929 ESTs 


2.78 


niuir . , f-v ft try A JM iMv* 

DU145, MB-MDA-453, MCF7 


313476 


AA010267 EST duster (not h UniGene) 


2.78 


NCWi520, NCkMoO, KT29 


327277 


CH.01.nsgi|586 7 473 


Z78 


DU145, CALUo, EB 


310981 


AM94514 Hs.171380 ESTs 


278 


LnCap, RPWc-2, NCI-H4oO 


335090 


aijaa c/^ckicc Ann ** 

CH22 - rGcNco.49U_.1 


277 


Kim uen ddcp i m ddcp mm 
NCI-nD9, PRoCjog, PR5C_C0n 


328581 


CH.07 w hs giJoOOoOoo 


2.77 


HT29, Md^MDA-453, MCF7 


333219 


CH22_FGENcS. 1 04_1 1 


2.77 


kir>i uco dqcp i M kif^i utile 

NC1-HD9, PRSCjog, NCI-H345 


308311 


AI581855 EST singleton (not in UniGene) with exon 


277 


ktn km A 094 UTAft am 1 in 

ftffl^MDA-231, HT29, CALU6 


■*007fifl 
OZS/DU 




977 


CALU6 DU145 EB 


303925 


AW469999 Hs.258523 ESTs 


2.77 


NCm69, LnCap, MBMHDA-231 


337628 


CH22^C20H12.GENSCAN^31 


Z77 


NCf-H69,UCap, MB-MDA-453 


333520 


CH22_FGENES.174.3 


2.77 


NCI-H69,NCI-H345.PRSC.con 


303168 


AA872479 Hs.197770 ESTs;WeaklysirnilartoestrogerKespons 


276 


DU145, OVCA-R, MB-MDA453 


313451 


AW138189 Hs.122672 ESTs 


2.76 


OVCA^,EB,DU145 


328474 


CH.07_hsgi)5868446 


2.76 


NCI-H69, NCI-H345, RPWE-2 


331988 


AA477414 Hs.9242 purine-rich element binding protein B 


Z76 


MB-MDA-435s, A549, OVCA-R 


306180 


AA922503 EST singleton (not m UniGene) with exon 


2.76 


NCI-H69.DU145, LnCap 


321071 


AA013011 H&241502 Cdc42 effector protein 4 


2.76 


PRSCjog, PRSC_con, NCI-H345 


302972 


W73400 EST 


2.76 


NCI-H345, RPWE-2, NCI-H69 


305185 


AA663985 t&248038 major histocompatibility complex; ctess 


2.75 


DU145.A549.BT474 
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Cn22»r6ENcS.65oJ 6 


2.75 


319138 R11699 


Hs.73818 ubiquinol-cytocnrame c reductase hinge p 


2.75 


lid 01 

336387 


CH22_FGENES.82<_7 


2.75 


338054 


CH22_EM^C005500.GENSCAN,158-2 


2.75 


316041 AA719183 


EST duster (not In UnfGene) 


2.74 


336863 


CH2?_FGENES297-4 


2.74 


335975 


CH22.FGENES.652.9 


2.74 


302952 AF103179 


EST 


2.74 


326122 


CH.17Jisgi|5867194 


2.74 


337427 




2.74 


308063 AI469244 


Hs.119252 tumor protein; translationally-oontrblle 


2.74 


325433 


CH.12.hs gl|5856936 


2.74 


316252 AI572633 


Hs.190406 ESTs 


274 


310837 AI418688 


Hs.170301 ESTs 


2.74 


313562 AW467335 Hs.257676 ESTs 


2.74 


335455 


CH22JGENES.5e2.15 


2.74 


304792 AA5B3101 


Hs.29797 ribosomal protein L10 


2.73 


331979 AA469937 Hs.105322 EST 


2.73 


336198 


CH22.FGENES.719J 


2.73 


314698 AI660452 


Hs.187127 ESTs 


2.73 


307954 AI419692 


EST singleton (not in UniGene) with exon 


2.73 


318288 AI088590 


Hs.134702 ESTs 


2.73 


327833 


CH.05JisgQ5867968 


273 


300221 AW449602 Hs.217953 ESTs; Highly slmSar to NK-TUMOR RECOGNI 


326039 


CH.17_hsg!J5867179 


273 


318457 AI149678 


Hs.143952 ESTs 


2.72 


336753 


CH22.FGENES.128-9 


272 


330086 


CH.19_p2gpi5293 


2.72 


333566 


CH22_FGENES.183_2 


2.72 


339384 


CH22_BA232E17.GENSCAN.3-22 


£71 


338668 


CH22JIAAC005500.GENSCAN.465-1 


2.71 


300798 AI382618 


Hs.194613 ESTs 


2.71 


303745 AI142379 


EST 


2.71 


305197 AA666301 


EST singleton (not in UniGene) with exon 


271 


338725 


CH22.EMAC005500.GENSCAN.499-1 


2.7 


307799 AI351112 


EST singleton (not in UniGene) with exon 


27 


309598 AW173642 Hs.250106 EST 


2.69 


302727 L10141 


EST 


2.69 


308544 A1695133 


EST singleton (not in UniGene) with exon 


269 


322877 AA079727 


EST cluster (not in UniGene) 


2.69 


325695 


CH.14_hsgi|6552446 


2.69 


307728 A1335557 


EST singleton (not in UniGene) with exon 


2.68 


302399 N79624 


EST 


2.68 


309343 AW028652 


EST singleton (not in UniGene) with exon 


2.68 


339360 


CH22.BA354I12GENSCAN.322 


268 


337821 


CH22_EM^C005500.GENSCAN.13-11 


2.68 


337338 


CH22J=GENES.717-7 


2.68 


334510 


CH22_FGENES398_8 


268 


300918 AA491286 Hs.128792 ESTs 


268 


335536 


CH22_FGENES.574_2 


267 


335311 


CH22.FGENES.532.4 


267 


338959 


CH22_DJ32J1 O.GEM SCAN .23-31 


267 


339081 


CH22J3A59H18.GENSCAN.37-10 


267 


334058 


CH22_FGENES.327_23 


2.67 


338976 


CH22_DA59H18.GENSCAN.1-3 


266 


325524 


CH.12_hsgp66981 


266 


333069 


CH22.FGBiES.76J 


2.66 


336203 


CH22JGENES.719.7 


266 


333133 


CH22_FGENES.83_9 


266 


304074 T77842 


Hs.142528 ESTs 


2.65 


330919 AA224594 Hs.66941 ESTs 


265 


333248. 


CH22.FGENES.115.5 


2.65 


336655 


CH22.FGENES.42-2 


265 


315322 AA770599 


EST cluster (not in UniGene) 


265 


307474 AI264023 


EST singleton (not in UniGene) with exon 


265 


320221 AL050020 


Hs.127384 DKFZP564C196 protein 


265 


301767 AW361892 


EST 


2.65 


327246 
337403 


CH.01_hsgij5867547 


2.65 


CH22.FGENES.752-2 


2.65 


328221 


CHX6Jsgi]5868099 


2.64 


336759 


CH22JFGENES.133-2 


264 


327532 


CH.02Jisgi|6469818 


264 


305621 AA789095 


EST singleton (not in UniGene) with exon 


264 


322931 AA099329 Hs.151764 ESTs 


264 


327278 


CH.01jBglp867473 


264 


332235 N51413 


Hs.109284 ESTs 


2.64 



NCW69, PRSC.con, RPWE-2 
NCI-H345,NCi-H69, PRSC.con 
PRSC.con, RPWE-2, PRSCJog 
OVCA-R, EB t OU145 
DU145,MCr7,MBM>A453 
MB-MDA453, MCF7, OVCA-R 
CALU6, EB,A549 
CALU6, MB-MDA-435S.BT474 
HT29.Caco2,PC3 
RPWE-2, NOH69, PRSCJog 
NCi-358,NCI-H23,Caco2 
NCI-H345, PRSC.con, RPWE-2 
OVCA-a.MCF7.A549 
NCJ-H345.PRSC.con, RPWE-2 
HT29.MCF7.MB4WDA-231 
NCW^LnCap, PRSC.con 
EB, OVCA-R, MB-MDA-453 
MCF7,BT474,NCW460 
NCWti9, PRSC.con, PRSCJog 
M&M)A-231,LnCap,BT474 
HT29, HT29, EB 

PRSCJog, NCW345.PRSC con 

BT474.PC3.MB4i/IDA-231 

2.73 NO-H520, NC W58, MB4UDA-453 

MB-MDA-453, EB, EB 

PRSC.con, PRSCJog, NCI-H345 

MB-MDA-435S.NCI-H520, MCF7 

HT29,MB-MDA453,MCF7 

HT29.BT474, OVCA-R 

NC1-H69, NCI-H345, PRSCJog 

NCJ-H69, RPWE-2, PRSC_con 

PRSC.con, NCI-H345, PRSCJog 

PRSCJog, PRSC.con, RPWE-2 

EB,NCl-H520,OVCA* 

CALU6,MB4/!DA453,PC3 

HT29,BT474,MCF7 

NCi-358,NCI-H69,NCi-H23 

OVCA-R, BT474.PC3 

HT29, CALU6, MB-MDA435s 

NCI-H345.NCW69, PRSC.con 

NCW69,NCl-H460,NCI-H460 

NCI-H69, PRSCJog, NCI-358 

NCW69.PRSC.oon, NCI-H345 

HT29, MB-MDA-231, MB-MDA-231 

NCI-H69, PRSCJog, PRSC.con 

PRSC.con, PRSCJog, PRSCJog 

NQ-H69, PRSC.con, PRSCJog 

NCi-H460,NCi-H23,Na358 

rV&MDA^35s,CALU6. DU145 

NCW69.NCI-H345, PRSCJog 

MB-MDA-435s 1 Caco2,A549 

NCI-H345, PRSC.con, NCI-H59 

NCI-H345, RPWE-2, NC1-H69 

PRSCcon, RPWE-2 PRSCJog 

PRSC.con, PRSC.bg, RPWE-2 

NCI-H345, RPWE-2, PRSC.con 

NCI-H69, NCJ-H345, PRSC.con 

OVCA-R, PC3.A549 

HT29, OVCA-R, A549 

DU145,CALU6,EB 

PRSC.con, RPWE-2, LnCap 

NCW345, PRSC.con, MB-MDA-231 

NC1-H69, PRSCJog, PRSC.con 

A549, MB-MDA-453, MB-MDA-435s 

NCI-H69, NCW345, RPWE-2 

MB^DA-453,MCF7,HT29 

NCI-H345. PRSC.con, PRSCJog 

EB, OVCA-R, DU145 

PRSC.con, PRSCJog, RPWE-2 

MCF7.MB-JWDA-231,BT474 

NC1-H69, PRSC.tog, PRSC.con 

PC3,CALU6,A549 

HT29, M&M)A-231, MB-MDA-453 

PRSC.con, RPWE-2, NCI-H345 

EB,NCI-H460,NCW69 

DU145.EB, OVCA-R 



108 
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332782 




CH22_FGENES.3_2 


2.63 


312340 


AI862666 Hs.176333 


ESTs 


2.63 


337464 




CH22_FGENES.7954 


2.63 


325783 




CH.14Jlsgl)6456780 


2.63 


303672 


AW5Q2380 Hs^10527 


ESTs 


2.63 


306009 


AA894560 


EST singleton (not in UniGene) with exon 


2.63 


308548 


AI695484 


EST singleton (not in UniGene) with exon 


2.63 


337930 




CH22_EM^C005500.GENSCAN.81-3 


162 


327791 




CH.05Jsgl|5857977 


162 


330925 


AA232678 Hs.87073 


ESTs 


2.62 


327259 




CH.01JJsgit5867454 


162 


302150 


AF061756 Hs.152531 


heart and neural crest derivatives expre 


161 


304881 


AA598501 Hs.195188 


glyceraMehyde-3-phosphate dehydrogenase 2.61 


335956 




CH22J=GENES.647_3 


2.61 


326506 




CH.19_hsgI|5867435 


2.61 


335863 




CH22_FGENES.629J 


2.61 


334752 




CH22_FGENES.428_1 


2.61 


333288 




CH22.FGENES.128J9 


2.61 


306709 


AI024215 Hs.131477 


EST 


2.61 


305816 


AA854776 


EST singleton (not in UniGene) with exon 


2.6 


327264 




CH.01_hsgi|5867461 


2.6 


310905 


AW075527 H&252259 


ribosomaJ protein S3 


2.6 


324492 


AA479507 Hs.135179 


ESTs 


2.6 


322649 


AA526549 


EST duster (not in UniGene) 


2.6 


329384 




CHX-hsgp68869 


2.6 


321240 


M62378 


EST duster (not in UniGene) 


ze 


302751 


AA299576 Hs.156110 


Immunoglobulin kappa variable 1 D-8 


159 


305841 


AA860348 


EST singleton (not In UniGene) with exon 


159 


324180 


AA402242 Hs.122799 


ESTs 


258 


334196 




CH22LFGBK&353.4 


258 


338451 




CH22_EM:AC005500.GENSCAN.35W9 


158 


300333 


AW2973S6 Hs.227052 


ESTs 


158 


305046 


AA632201 


EST singleton (not in UniGene) with exon 


158 


305648 


AA807652 Hs.156110 


Immunoglobulin kappa variable 1D-8 


157 


301744 


W22230 


EST 


157 


329182 




CHXjtsgi|6056331 


157 


318178 


AW137425 Hs.158401 


ESTs 


257 


330057 




CH.17_p2gi|6478962 


157 


326552 




CH.19Jisg]|5867308 


157 


311956 


T67085 Hs.188464 


ESTs 


157 


327185 




CH.0Lhsgil6117805 


157 


302183 


NM_00224 


EST 


157 


327263 




CH.01_hsgij6525274 


156 


339164 




CH22_DA59H18.GENSCAN.6W 


156 


332763 


AA063554 Hs.90959 


ESTs 


156 


330579 


U67733 Hs.154437 


phosphodiesterase 2A; cGMP-stimulated 


155 


329948 




CH.16_p2gij5540101 


255 


300282 


AW044305 Hs.236131 


ESTs; Highly similar to homeodomain-inte 


155 


335448 




CH22LFGEKES.562_5 


2.55 


330959 


H09174 HS26484 


H IRA-interacting protein 3 


155 


307262 


AI202100 


EST singleton (not in UniGene) with exon 


155 


335806 




CH22_FGENES.616_8 


255 


335782 




CH22J=GENES.609_4 


255 


301703 


AW301478 


EST 


155 


329018 




CHXhsgil6249620 


154 


329870 




CH.14_p2gip706435 


154 


334504 




CH22LFGENES39BJ 


154 


304707 


AA564846 


EST singleton (not in UniGene) with exon 


153 


329326 




CRXJisgi|5868806 


153 


334418 




CH22.FGENESJ84.5 


153 


338124 




CH22_EMJ\C005500.GENSCAN.196-2 


253 


318423 


A1362671 Hs.214491 


ESTs 


253 


333006 




CH22.FGENES.60_3 


253 


333668 




CH22_.FGENES.245 2 


253 


333567 




CH22_FGENES.184J 


153 


309592 


AW172384 


EST singleton (not in UniGene) with exon 


252 


328989' 




CH.09_hsgi|5868535 


152 


326725 




•CH.20_hsgiI6552456 


152 


302996 


AF054663 


EST 


152 


335733 




CH22.FGENES.601J 


152 


336000 




CH22_FGENES.658J 


252 


327774 




CH.05_hsgi[5B67964 


152 


328557 




CH.07Jisgi|5868489 


152 


328228 




CH.06_hsgi|5868105 


152 


328305 




CH.07Jisgi]6Q04478 


152 


334010 




CH22LFGENES.313J 


151 



HT29 ( Caco2 f A549 

NCi-358,NCI-358,HT29 

NCI-H69 t NCl-H345,PRSC_con 

EB, OVCA-R, PC3 

PRSCJog, NCI-H345.NCI-H69 

HT29,MB-MDA-231,CALU6 

PC3,A549,NCt^58 

PC3, OVCA-R, MCF7 

PRSCJog, PRSCj»n,NCI-H345 

OVCA-R, MCF7, UCap 

Na-H345,PRSCjx)n, RPWE-2 

OVCA-R, PC3,A549 

M3-MDA435S, NCI-H23, MCF7 

DU145,PRSC_con,PC3 

RPWE-2, NO-H460.NC1-358 

PC3,HT29,NCl-358 

PRSCjxwi, NC KH68 P PRSCJog 

HT29, NCL358,Caco2 

MB-MDA-435s ( MCF7,BT474 

MB4UDA-453, MCF7. M&M5A-435S 

MB-MDA-435S, MB-MDA-435S, MB-MDA453 

OVCA-R, EB,DU145 

DU145, EB, OVCA-R 

PRSCjxwi, RPWE-2, PRSCJog 

NCI-H69,NCI-H345,PRSC_con 

BT474,CALU6,MB^iDA^31 

MCF7,ivBM3A453,OVCA^ 

NCI-H345, PRSCJog, PRSCjon 

EB, PC3, HT29 

NC1-H345, NCI-H69, PRSC.oon 
MBM)A435s,NCI-H23,MCr7 
PRSC_con, PRSCJog, NCW69 
NC1-H460, MB-MDA453, M&MDA-435S 
PRSCjoon, RPWE-2, NC1-H345 
PRSCjxxi, PRSCJog, NC1-H345 
PRSC.con, RPWE-2, NC1-H345 
MB4/DA-231, PRSC.con, BT474 
NCI-H345, RPWE-2. PRSC.con 
NCI-H345, PRSCjoon, RPWE-2 
HT29,MWiflDA453,NCI4M60 
CALU6,HT29,EB 
MCF7.PC3, OVCA-R 
PRSC.con, NCI-H69, PRSCJog 
NC1-H69, PRSC_con, NCI-H345 
RPWE-2, NCI-H345, PRSC.oon 
HT29.CALU6.PC3 
NCW460, MCF7, MBWA-453 
NC1-H460, NCI-H23, NC1-H23 
MB-M3A453.BT474.MCF7 
MWvDA-453, HT29, MCF7 
MCF7,DU145,MB^DA435s 
NCI-H345,NCI-H69 1 PRSC_oon 
Caco2, MB-MDA-453, MB-MDA-435S 
PC3, MCF7,MB4l!DA-453 
NCI-H69, PRSCJog, PRSC.oon 
NCm23,NCm460,NCi^58 
HT29,BT474,MB^DA-231 
NCW520, EB.NOH460 
MB-MDA-231, NCI-H345, NCI-H69 
NCI-H23,NCW58,NCW460 
NCI-H69, PRSC_con, PRSCJog 
OVCA-R. EB.DU145 
NC1-H69, PRSCjan, PRSCJog 
NCI-H69, PRSCJog, PRSCjan 
NCI-H69.NCI-H345, PRSC.ccn 
UCap,NCW69,DU145 
MB-MDA435S, OVCA-R, EB 
PRSC_con.NCI-H345,NCI-H69 
HT29 1 BT474,CALU6 
NCI-H69, PRSCJog, NCW345 
UCap, OVCA-R DU145 
DU145,GALU6,HT29 
MB-MDA-453, MB-MDA-435S, MCF7 
NCI-H69, NC1-H345, PRSC.oon 
NCI-H59,NCWW60, PRSCJog 
NCI-H69, PRSCJog, PRSCjoon 
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339033 CH2^_DA59H18.GENSCAN56-1 151 

335340 CH22_FGENES.535J7 2.51 

300156 AI245582 Hs.233395 ESTs 2.51 

305880 AA866065 Hs.156110 Immunoglobulin kaprra variable 1I>8 25 

310841 Af958009 Hs.232024 ESTs 25 

335908 CH22.FGENES.343-2 25 

304674 AA541735 EST singleton (not In UnlGene) with exon 15 
314521 AW503939 Hs.107149 ESTs; Weakly simitar to PTB-ASSOCIATED S15 

307592 A12B5739 EST singteion (not in UniGene) with exon 15 

331476 N26190 Hs.43768 ESTs 25 

325803 CH.l4Jisgl|6552451 25 

308549 AA993796 EST singleton (not In UniGene) with exon 2.49 

304833 AA586504 EST singleton (not In UniGene) with exon 2.49 

336333 ' CH2£.FGENES.813J 2.49 

332320 T71134 Hs.100551 EST 2.49 

328236 CH.0B_hsgP6B117 2.49 

317335 AI656979 Hs.130210 ESTs 149 

33918B CH22JDA59H18.GENSOW.72-16 2.48 

334235 CH22JGENES.361J9 148 

301214 AW45O950 Hs.157034 ESTs; Weakly similar to Unknown [H^apte 2.48 

332843 CH2^FGENES.19.1 2.48 

337431 CH2Z/GENES.763-7 148 

336757 CH22JGENES.131-1 2.48 

305403 AA723748 EST singleton (not in UniGene) with exon 2.48 

330065 CH.19_p2gP65044 2.48 

309245 AI972447 EST singleton (not In UniGene) with exon 2.48 

328876 CH.07Jtsg!|6525286 2.47 

333944 CH22_FGENES.302 - 2 2.47 

328504 CH.07_hsgiJ5868471 2.47 

338120 CH22.EIWkAC005500.GENSCAN.195-1 2.47 

306710 AIQ24221 EST singleton (not in UniGene) with exon 2.47 

305064 AA636012 EST singleton (not in UniGene) with exon 2.47 

329995 CH.16j>2gi|4567166 2.47 

315694 AI821743. Hs.168418 ESTs; Moderately similar to III! ALU SUB 146 

331004 H64622 Hs.32748 ESTs 2.46 

305259 AA679225 EST singleton (not in UniGene) with exon 2.46 

304576 AA498563 EST singleton (not in UniGene) with exon 2.46 

318887 R60487 Hs.21065 ESTs 2.46 

308954 AI868958 EST singleton (not in UniGene) with exon 2.46 

301140 AI807692 I&207128 ESTs 146 

322085 AA088500 Hs.170298 ESTs 146 

339130 CH22_DA59H18.GENSCAN.564 2.46 

337612 CH22LC20H12.GENSCAN.22^ 146 

313765 AW206181 Hs.165981 ESTs; Weakly similar to gag [Haptens] 2.45 

311665 AW294254 Hs.223742 ESTs 2.45 

328620 CH.07Jisgi|5868241 2.45 

305361 AA708902 EST singleton (not in UniGene) with exon 145 

336243 CH2LFGENES.746J 2.44 

320299 H08323 Hs.177181 ESTs 2.44 

302535 H48676 EST 2.44 

333465 CH2^FGENES.160J2 144 

334109 CH22_FGENES.330 8 ' 144 

301749 F12998 Hs.90790 ESTs 144 

324575 AW502257 EST duster (not in UniGene) 2.44 

337114 CH2^FGENES.494-17 2.44 

336087 CH2?_FGENES.688J6 144 

315678 AI657119 Hs.120036 ESTs 144 

333258 CH22J=GENES.118_6 2.44 

303798 V00505 Hs.36977 hemoglobin; delta 2.44 

309759 AW268822 EST singleton (not in UniGene) with exon 144 

316946 AI122843 EST duster (not in UniGene) 144 

321986 AL133656 EST duster (not In UniGene) 144 

338151 CH2a_EMAC005500.GENSCAN^07-5 144 

327056 CH^1_hsglI6531965 144 

309605 AW182800 EST singleton (not in UniGene) with exon 143 

335783 CH2£_FGENES.610_3 2.43 

325790 CH.14_hsgil6381957 2.43 

339342 CH22_BA354I12.GENSCAN17-10 2.43 

335777 CH22.FGENES.607J3 2.43 

309972 AW450350 Hs.257283 ESTs 2.43 



308716 AI798009 EST singleton (not in UniGene) with exon 2.43 

338087 CH22Jr^AC005500.GENSCAN.174-16 2.43 

306930 AI124518 EST singleton (not In UniGene) with exon 2.43 

319032 AW409728 Hs.80449 ESTs; Weakly similar to cytoplasmic dyne 2.43 

304330 AA157834 EST singleton (not in UniGene) with exon 2.43 

320638 R54766 Hs.101120 ESTs , 2.43 



NCW69.NCI-H345. PRSCjxm 
NCm69.PRSC.con, PRSC.bg 
PRSCjxm, PRSCJog, NCI-H345 
EB, OVCA-R, DU145 
UCap, NCW58, CALU6 
NCI-H345, RPWE-2, PRSCJog 
RPWE-2, NO-H69.MCF7 
NCI-H460,EB,Caco2 
PRSCjwn, NOH345, PRSCJog 
NCI^,NCI-H69. PRSCjxm 
NCI-H345, RPWE-2, PRSC_con 
A549, OVCA-R, CALU6 
MCF7.DU145.UCap 
NCI-W45. PRSCjxm, PRSCJog 
NCI-H345,LnCap, RPWE-2 
PRSCjxm, NCI-H345, PRSCJog 
MCF7, MB-MDA-453, PC3 
NCI-H69. PRSCjxm, PRSCJog 
NCI-H520.NB-MDA-453.A549 
HT29.A549.A549 
DU145,CALU6,EB 
PRSC.con, RPWE-2, NCI-H69 
NCI-H69, PRSCJog, PRSCjxm 
NCI-H23.DU145, OVCA-R 
PRSCjxm, PRSCJog, NCI-H69 
MB-MDA-231, NCI-H69, HT29 
MBM)A-231 I CALU6,PC3 
NCI-H69. RPWE-2. PRSCJog 
LnCap, MB44DA453, MB-MDA435s 
MB4KDA-231, NCI-H69, PRSCjxm 
OVCA-R, EB,LnCap 
NCI-H69, RPWE-2, PRSC_con 
OVCA-R, DU145,M&MDA-453 
EB.A549.UCap 
EB, MCF7, MB-MDA-435s 
PRSCjxm, NCI-H345, RPWE-2 
PRSCjxm, RPWE-2, PRSCJog 
NCI-H345,Caco2,Caco2 
PRSCjxm, PRSCJog, RPWE-2 
OVCA-R,MB^DA-231,HT29 
PRSCJog, PRSCjxm, NCI-H345 
NCI-H345, PRSCjxm, RPWE-2 
EB,A549,Caco2 
RPWE-2, PRSCJog, PRSCjxm 
PRSCJog, RPWE-2, PRSCjxm 
MB-MDA453, MCF7, MB-MDA-435S 
HT29.MB-MDA435s.A549 
OVCA-R, MB-MDA-453, MB«A-435s 
PRSCjxm, RPWE-2, NCI-H345 
MB-MDA-453, EB, DU145 
NCI-H69, PRSCjxm, PRSCJog 
NCWi69, NCI-H345, PRSCJog 
NCW345, RPWE-2, PRSC tog 
NCI-H345, PRSC_con, RPWE-2 
NCI-H69, PRSCJog, PRSCjxm 
PRSCjxm, Caoo2, PRSC jog 
NCW58,PC3,NCm23 
MB-MDA-231, KT29, CALU6 
MB-MDA435S, MCF7, MB-MDA453 
MB-MDA-453, EB, MCF7 
PC3.0VCA4IDU145 
DU145,CALU6,CALU6 
PRSC_con, PRSCJog, RPWE-2 
PRSC_con, NCI-H345, RPWE-2 
NCI-358, NC1-H23, NCI-H520 
PRSCjxm, PRSCJog, NCI-H345 
MB-MDA-435S, MB-MDA-453, MB-MDA-453 
BT474. MB^A-231, MB-MDA453 
DU145, EB.BT474 
MCF7, MB-MDA-453, OVCA-R 
NCI-H345, PRSCjxm, PRSC tog 
DU145, PC3, CALU6 
NCW69,MCF7,BT474 
RPWE-2, A549.NCI-H69 
MB-MDA-453, PC3, OVCA-R 
MCF7, MB4YDA435S, MB-MDA-453 
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335281 CH22JGENES.524J 

317431 A1675790 Hs.132453 ESTs 

308511 AA988891 . EST singleton (not in UnlGene) with exon 

333298 CH22_fGENES.133_4 

328436 Ca07Jisgi|5868417 

333420 ' CH22.FGENES.146J1 

338113 CH2^.EMAC005500.GENSCAN.188-13 

335188 CH22_FGENES.507J 

329164 CHXJisg(|5868691 

336316 CH^GENESTOgjI 

310831 AI927594 Hs.161142 ESTs 

327334 . CH.01Jisgi|5902477 

334017 CH22_FGENES315_2 

308136 AI494446 EST singleton (not In UnlGene) with exon 

333074 CH22JH3ENES.76J0 

306546 AA893109 EST singleton (not in UnlGene) with exon 

336516 CH22J=GENES.836J 

306791 AI042387 EST singleton (not In UniGene) wiih exon 

329411 CKXJlsgi|6682549 

308659 AI750091 EST singleton (not in UniGene) wtth exon 

313504 AJ190405 Hs.143127 ESTs 

326073 CH.17_hsgI|6682495 

334047 CH22_FGENES.326_5 

325464 CH.12J»gl|5866947 

334764 CH22LFGENES.428 13 

312737 AJ033500 Hs.132895 ESTs 

306591 AI000248 EST singleton (not En UnlGene) with exon 

333582 • CH22_FGENES.201_2 

337843 CH22_EMAOT5500.GENSCAN3W 

335284 CH22LFGENES.526 6 

305134 AA653159 EST singleton (not in UniGene) with exon 

335527 CH2Z_FGENES572_7 

336795 CH22_FGENES.176-5 

303144 AF2Q2869 EST 

334948 CH22_FGENES.465J5 

328860 CH.07_hsgiIK81928 ' 

322929 AJ365585 Hs.146246 ESTs 

333561 CH2^FGENES.180J8 

338239 CH2^EMAC005500.GENSCAN^5 

323670 AL04O411 Hs.161763 ESTs; WeaWy similar to K1AA0738 protein 

305903 AA873085 EST singleton (not In UniGene) with exon 

312573 AW297673 Hs. 190526 ESTs 

334470 CH22J=GENES.394_1 

333272 CH22_FGENES.122_1 

304010 AW518383 Hs.177592 rajosomal protein; large; P1 

337316 CH22.FGENES.692-1 

316769 AI914939 Hs.212184 ESTs 

336280 CH22_FGENES.763J 

331223 T98872 Hs.194181 ESTs 

337172 CH22_FGENES.565-2 

300625 AI671992 Hs.143631 ESTs; Weakly similar to WASP-famBy prat 

337092 CH22_FGENES.465-12 

334528 CH22J=GENES.402J 

338411 CH2^.EMAC0O5a)0.GENSCAN.341.7 

331344 AA357927 Hs.70208 ESTs 

334044 CH22_FGENES323_2 

333918 CH22_FGENES.296_7 

317168 AI042614 Hs.125910 ESTs 

333424 CH22LFGENES.147.4 

317779 AW450515 Hs.128381 ESTs 

315142 AJ380577 Hs.190219 ESTs 

310471 AW270515 Hs.149596 ESTs 

325049 AW410339 H&256310 ESTs; WeaWy similar to centaurin beta2 

305234 AA670431 EST singleton (not in UniGene) wtth exon 

337760 CH22.EMACOO0O97.GENSCAN.11W 

311502 AW204380 Hs.208662 ESTs 

337548 , CH22_FGENES.844-5 

326981 CH.21 hsgi|6588016 

309600 AW182066 EST singleton (not in UniGene) with exon 

328936 CH.Q8Jlsgi|5868500 

327937 CH.06_hsgij5B68192 

328282 CH.07Jisgif5868353 



303607 AL046388 Hs.208206 ESTs; Weakly similar to Naf1 alpha prote 

304227 N94974 Hs.75344 ribosomal protein S4; X-Cnked 

314101 AW452279 Hs.257542 ESTs 

325026 AI671168 Hs.12285 ESTs 



1 A'i 

2.43 


PC3, LnCap, A549 


2.43 


NCI-H345, RPWE-2, PRSCJog 


1 A1 

2.43 


OVCA-R, EB P DU145 


O A'i 

2.43 


to, DU145, PC3 


*i A'i 

Z43 


EB, LnCap, A549 


2.43 


NCI-H345, NCI-H69, PRSCJog 


2.42 


DU145, EB, CALU6 


2.42 


EB,A549,BT474 


2.42 


RPWE-2, PRSCjoon, PRSCJog 


2.42 


MB4/IDA-435S, MCF7, NCI-H69 


2.42 


NCI-H345, PRSC.con, PRSC_tog 


2.42 


MB-MDA-453, MB-MDA-435S, MCF7 


242 


PRSC.con, PRSCJog, RPWE-2 


2.42 


OU145, LnCap, EB 


2.42 


NCI-H69, RPWE-2, PRSCJog 


242 


HT29, CALU6, LnCap 


O AO 

2.42 


X iot ■ ir>n nnoA nrir»n i 

NCI-H69, PRSC_con, PRSCJog 


o An 
2.42 


CALU6, DU145, EB 


1 A'i 

2.42 


OVCA-R, EB, LnCap 


2.41 


EB, DU145, CALU6 


2.41 


DU145, EB, CALU6 


4 AA 
2.41 


DU145, A549, MB-MDA-435S 


2.41 


PRSC.con, PRSCJog, NCI-H345 


2.41 


NCI-358, NC1-H23, NCWW60 


2.41 


NCI-H69, NCJ-H345, RPWE-2 


2.41 


OVCA-R, DU145, CALU6 


2.41 


MB-MDA-231, MCF7, DU145 


2.41 


NCI-H69, PRSC_con, PRSCJog 


2.4 


EB, LnCap, A549 


2.4 


NCI-H69, NOH345, PRSCJog 


2.4 


DU145, HT29, MB-MDA-453 


24 


DU145, OVCA-R, EB 


24 


NCJ-H69, NCI-H345, PRSCJog 


2.4 


PRSC.con, PRSCJog, NO-H69 


24 


PRSCjxm, PRSCJog, RPWE-2 


24 


PRSCjan, PRSCJog, NCI-H345 


24 


NCI-H460, A549, HT29 


2.4 


OVCA-R, EB, DU145 


2.4 


NCI-H69, NCI-H345, PRSCjoon 


24 


DU145, MB-MDA-453, EB 


24 


MCF7.A549, NCI-H520 


2.4 


LnCap, NC1-H460, NCI-H23 


24 


NCI-H520, HT29, NCI-H23 


239 


NCI-H345, PRSC_con, RPWE-2 


239 


DU145, CALU6, EB 


239 


MCF7,BT474,OVCA-* 


239 


PRSC_con, NCI-H345, RPWE-2 


239 


NCI-H345, PRSCJog, PRSC con 


239 


DU145,HT29,PC3 


239 


EB, OVCA-R, DU145 


239 


EB,NCi-H520, LnCap 


2.39 


PRSCjan, PRSCJog, NCI-H69 


239 


NCLH345, PRSC.oon, NCI-H69 


239 


NCI-H345, NCI-H69, PRSC_con 


239 


PC3,EB,A549 


238 


MB-MDA-231, MCF7, LnCap 


238 


RPWE-2, NC1-H345, EB 


2.38 


NCI-H345, PRSCjan. RPWE-2 


238 


DU145, MCF7, OVCA-R 


2.38 


EB, DU145, OVCA-R 


238 


OVCA-R, EB.CALU6 


238 


NCI-H460, NCI-H23, NCI-H23 


238 


PRSC_con, RPWE-2, NCM345 


2.38 


M&M)A-453, MB-MDA-231, A549 


L.OQ 


PRQP mn PRSfi Inn RDWC O 


238 


NCI-H345,NCI-H69, LnCap 


238 


MB-MDA453,MCF7,CALU6 


238 


NCI-H345,NCI-H69,PRSC con 


237 


RPWE-2, NCI-358, NCI-H69 


2.37 


OVCA-R, MB-MDA-453, CAUU6 


237 


BT474.EB, OVCA-R 


237 


DU145, CALU6, CALU6 


237 


LnCap, PRSCJog, NC1-H345 


237 


EB, PC3, OVCA-R 


237 


OVCA-K.CALU6,CALU6 


237 


NCI-H345, PRSC.con, PRSCJog 
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315015 AI659989 Hs.132625 ESTs 2.37 

328662 CH.07J1S gIJ6004473 137 

305867 AA664572 EST singleton (not in UnlGene) with exon 237 

333296 CH22LFGENES.132L3 257 

331070 R01116 Hs.182059 ESTs 236 

333698 CH2Z/GENES.250J2 236 

316423 AA758756 Hs.121380 ESTs 2.36 

323189 AL121194 Hs.120589 ESTs 2.36 

318889 Z43296 Hs.18720 programmed cell death 8 (apoptosis4nduc 2.36 

334237 CH22.FGENES.362.1 2.36 

315931 A1700148 Hs.117328 ESTs 2.36 

326884 CR20Jtsgi|6682511 256 

333132 CH22.FGENES.83_8 236 

306574 AA995719 Hs.76067 heat shock 27kD protein 1 236 

324416 AI669524 Hs.194115 ESTs 256 

329496 CH.10_p2gi|3983518 235 

320994 H22381 EST duster (not In UnlGene) 235 

320481 AA461139 Hs.24372 ESTs; Weakly similar to dJ207H1 .1 |H.sap 255 

309958 AW444488 EST singteton (not in UnlGene) with exon 2.35 

327009 CR21_hsgiJ5867664 235 

309594 AW172821 Hs.181165 eukaryotic translation elongation factor 235 

335468 CH22-FGENES.567.4 235 

304269 AA069Q29 EST singleton (not In UniGene) with exon 255 

305877 AA865649 EST singleton (not in UnlGene) with exon 255 

' 305700 AA815428 EST singleton (not In UniGene) with exon 255 

326423 CH.19Jisgi|5867369 254 

334560 CH22_FGENES.404_3 234 

337100 CH22_FGENES.472^3 234 

301505 AWD14374 Hs.144849 ESTs 254 

312142 AW288359 Hs.221069 ESTs 234 

305787 AA845035 EST singleton (not in UniGene) with exon 254 

338686 CH22_EMAC005500.GENSCAN.472-5 253 

331977 AA465207 Hs.125887 ESTs 233 

314687 M79114 Hs.135177 ESTs 253 

336089 CH22_FGENES.688_18 253 

338952 CH22JU32I10.GENSCAN.23-22 233 

334612 CH2^FGENES.411_11 253 

338223 CH2^EMAC005500.GENSCAN250-10 253 

327845 CH.05_hsgi|6531962 252 

308187 AI538108 Hs.1561 10 Immunoglobulin kappa variable 1W 252 

317767 AW294164 Hs.128340 ESTs; Weakly similar to Cdc42 GTPase-act 252 

330468 L10343 Hs.112341 protease Inhibttor 3; sWivdenVed (SKAL 2.32 

319003 R17712 EST duster (not in UnlGene) 252 

323022 A1056733 Hs.133865 ESTs 252 
303148 R73167 Hs.127317 ESTs; Weakly similar to CYTOCHROME P450 

303215 AW250314 EST 232 

318891 H10477 Hs.1 96208 ESTs; WeaWy similar to !l!l ALU SUBFAMI 252 

336653 CH22_FGENES.33-4 ' 252 
333329 CH22LFGENES.138J2 252 
301980 U69962 Hs.121498 poteium voltage-gated channel; Shatne 251 
336968 CH22.FGENES575-28 251 
308539 A1694191 EST singleton (not in UniGene) with exon 251 
326417 CH.19Jisgi!5867362 251 
328851 CH.07.hsgiJ6381923 251 
329254 CHXJisgil5868733 251 
303075 W88779 Hs.59125 ESTs 25 
335131 CH22LFGENES.497J5 25 
303129 AA308334 Hs.172210 MUF1 protern 25 
327067 CR21jBgl|6531965 25 
324064 AW137650 EST duster (not in UniGene) 25 
325965 CH.16>gi|5867147 25 
334525 CH22JGENES.402L4 23 

336654 CH22LFGENES54-2 25 
302348 AF100779 Hs.194680 WNT1 inducible signaling pathway protein 25 
309275 AI989570 EST singleton (not in UnlGene) with exon 23 
329246 CHJChsgiI5868732 Z3 
305557 AA774834 EST singleton (not In UnlGene) with exon 23 
322907 AA084941 EST duster (not in UniGene) 25 
318683 AI703241 Hs.202653 ESTs; Weakly similar to Xin [M.muscutus] 229 
309233 A1971416 EST singleton (not in UnlGene) with exon 2.29 
308913 AI860692 Hs. 119122 ribosomal protein L13a 229 
335827 CHZLFGENES.620 1 22& 
334066 CH2SJGENES527J1 2.29 
302656 AW293005 Hs220S05 ESTs 229 
308974 AI872290 Hs.140 immunoglobulin gamma 3 (Gm marker) 229 
333607 CH22.FGENES.216J 129 



MB-M3A453, MB-MDA-231 , LnCap 

NCW345, RPWE-2, PRSC.con 

MCF7, MB-MDA-453, M&MDA-231 

EB,PC3,CALU6 

OVCA-R, MB4tfDA453,A549 

HT29,OVCA-K,Caco2 

HT29, MCF7,MB-MDA435s 

PC3,NCI-H460,DU145 

OVCA-R, A549, MB-MDA-453 

NCI-H345,NCI-H69, LnCap 

MCF7,NCI-H345 t DU145 

A549,EB, PC3 

NCi-H69»HT29,EB 

RPWE-2, PRSCJog, PRSC.con 

NCWM5, RPWE-2, PRSC.con 

HT29.MCF7.M&WDA-231 

NCW23.A549, CALU6 

PRSC.con, RPWE-2, PRSCJog 

NC1-H345, PRSC.con, PRSCJog 

HT29, BT474, MCF7 

H729, DU145.EB 

NCW69, PRSC.con, NCI-H345 

PRSC.con, PRSG_tog, RPWE-2 

A549.MCF7, OVCA-R 

PRSCjon, NCW345, PRSCJog 

PC3, MCF7, LnCap 

HT29, NCI-H460, MB-MDA435s 

PRSCJog, PRSC.con, RPWE-2 

CALU6, MB-MDA-231, DU145 

PRSC.con. RPWE-2, PRSCJog 

NOH23,NCI-H520,NCI-H460 

BT474, MB-MDA-231. MB-MDA-453 

OVCA-R, A549, MB-MDA-435S 

NOH69, PRSC.con, NCI-H345 

PRSC.con, Caco2, PRSCJog 

PC3, OVCA-R, HT29 

OVCA-R, MB-MDA-453, EB 

DU145.MB-MDA453.MCF7 

OVCA-R, MB-MDA-453, PC3 

NCI-H69, NCI-358, PRSC.con 

BT474, CALU6, MB-MDA-231 

PC3,Caco2,HT29 

MCF7, PC3, MB-MDA-453 

CALU6, MB^DA-231, DU145 

252 NCI-H345, PRSC.con, RPWE-2 

NCI-H345, PRSC.con. PRSC tog 

NCW69,LjiCap,NC|.H345 

DU145, EB. LnCap 

DU145,BT474,MB4/DA-231 

NCI-H345, MB-MDA-231, LnCap 

HT29,BT474,EB 

NCW345, NCI-H69, PRSCJog 

HT29.MCF7.BT474 

NCm520, NCJ-H460, NCWH23 

RPWE-2, NCI-H345, PRSC.con 

DU145, OVCA-R, EB 

NOH69.NCI-H345, PRSCJog 

LnCap, DU145.HT29 . 

NCI-H345, NCI-H69, MB-MDA435S 

DU145, HT29, EB 

NCI-H69, NC1-H345, RPWE-2 

NCI-H345, PRSC.con, NCI-H69 

BT474.PC3, MB-MDA-453 

LnCap, CALU6, DU145 

NCI-H460, NC1-H23.NCI-H520 

NCJ+i69,NCW345,PRSC tog 

CALU6,CALU6,MCF7 

MB-MDA-231, CALU6.EB 

NCI-H345, PRSC.con, RPWE-2 

CALU6, OVCA-R, EB 

MB-MDA435s.MCF7.Ht29 

PRSC.con, PRSCJog, RPWE-2 

PRSCcon, PRSCJog, NCW345 

NOH23,Caco2,CALU6 

CALU6,A549,NCI-H69 

OVCA-R, MCF7,A549 
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335174 CH2i.FGENES.aMJ 229 

332028 AA4B9680 Hs.134405 ESTs; Weakly similar to DimlphomologlH 129 

335417 CH22_FGENES.823_39 259 

323426 AA251401 EST duster (not In UniGene) 129 

336618 CH2LFGENES5-1 129 

310017 AI188739 Hs.148488 ESTs 129 

334055 CH2a/GENES.327„6 258 

337168 CH22_R3ENES.562-2fl 128 

329824 CH.Kp2g(|6530758 128 

333B91 CH2£_FGENES592.13 258 

339127 CH2aj5A59H18.GENSCAN55-1 128 

305686 AA812726 EST singleton (not in UniGene) with ©con 128 

329782 CH.14j»2gl|5912597 258 

311059 AI6100O1 Hs.175346 ESTs 128 

336934 CH2a.FGENES.35M 128 

314893 AA761093 EST duster (not in UniGene) 258 

331596 N72574 Hs.50220 ESTs 258 

330729 AA258559 Hs.3736 ESTs; WeaWy similar to DELTA-UKE PROTE 228 

338285 CH2^EM:AC005500.GENSCAN.29W 127 

300154 AI245127 Hs.179331 ESTs 257 

306383 AA969078 Hs. 183698 ribosomal protein 159 257 

309005 AI884454 EST singleton (not in UniGene) with exon 257 

332995 CH22.FGENES.58J 257 

337426 CH22_FGENES.761-3 257 

337778 CH2LEM-JVC000097.GENSCAN.119^0 127 

329705 CH.14jj2gi|6065790 127 

335971 CH2^FGENES.65a.4 257 

315862 AI075846 Hs.133998 ESTs 127 

316466 AI911204 Hs.126365 ESTs 257 

334430 CH22_FGENES.385 3 257 

331941 AA452257 Hs.99272 ESTs 256 

301230 AW269804 Hs, 153019 ESTs 126 

317394 AI935024 Hs.190518 ESTs 256 

306220 AA928363 EST singleton (not in UniGene) with exon 256 

304134 H54627 EST singleton (not In UniGene) with exon 256 

335421 CH22.FGENES-551.1 256 

305260 AA679280 Hs.156110 Immunoglobufin kappa variable 1[>8 256 

303592 AA421129 EST 126 

317982 AI004985 Hs.130607 ESTs 256 

325304 CH.11Jtsgl|5886910 256 

334118 CH2JLFGENES.330_.19 126 

335687 CH22_FGENES^96^2 256 

334035 CH22_FGENES.322_3 256 

305454 AA738413 EST singleton (not in UniGene) with exon 125 

335902 CH22.FGENES.635J0 125 

339215 CH22_F113D11.GENSCAN.6-10 125 

328810 CR07JS gi|5868327 125 

337396 CH22.FGENES.749-1 255 

336808 CH22.FGENES505.3 125 

305808 AA853958 EST singleton (not In UniGene) with exon 124 

333571 CH22J=GENES.188J 254 

323023 AA225188 Hs558539 ESTs 254 

334626 CH2^FGENES.416J 254 

333593 CH22_FGENES510_2 124 

326708 CH.20_hsgP67593 254 

314502 A1041717 Hs.132141 ESTs 253 

309181 AI9517Z7 EST singleton (not in UniGene) with exon 253 

324926 H56196 Hs.1 17798 ESTs 123 

333532 CH22JGENES527J 253 

328243 CR06jTsgif6056292 253 

327037 CH51JisgI[6531965 253 

307380 A1222985 EST singleton (not in UniGene) with exon 253 

334766 CH22_FGENES.428.15 253 

335236 CH22JGENES.515_8 253 

336615 CH2^FGENES.613.5 123 

307558 AI281998 EST singleton (not in UniGene) with exon 123 

308029 A1457115 Hs.62954 ferritin; heavy polypeptide 1 253 

331508 N47559 Hs.46732 EST 253 

320980 AJ237672 Hs514142 5;Kknethyienetetrahydrafolate reductase 253 

304241 AA010976 EST singleton (not in UniGene) with exon 123 

314682 A1190864 Hs.178226 ESTs; Weakly similar to Ml ALU SUBFAMI 123 

308382 AI624301 EST singleton (not in UniGene) with exon 252 

314476 AW207857 Hs.169604 ESTs 2.22 

327864 CHJ)6_hsgi(5868130 122 

337279 CH22.FGENES.665-2 252 

302263 AA325517 EST 252 



HT29, A549, MB-MDA-453 
EB.A549.DU145 
NCI-H69.NCI-H345, PRSCJog 
KT29,MB-MDA-231,BT474 
NCU58,NCI-H460,NCI-H69 
NCI-H345, PRSCJog, PRSC.con 
DU145, OVCA-R, MBM3A-453 
NCI-H69, PRSCJog, NCW345 
NCW23,CALU6, RPWE-2 
NCI-H69. MB-MD/^231, RPWE-2 
PRSC.con, NCW345, RPWE-2 
Na-H520,NCW23,NCI-H460 
NCI-H69, NCI-H345, PRSCJog 
MCF7,BT474,M&MDA-435s 
BT474,HT29,MB-MDA435s 
OVCA-R, HT29.DU145 
A549,MCF7,NCW58 
M&M>A-231,CALU6,MCF7 
NCkr^PRSCJco,, PRSC.con 
NCI-H23,NCI-H520,Na358 
RPWE-2, NCI-H345, PRSC_log 
A549.MCF7.BT474 
RPWE-2, NCI-H345, PRSCJog 
DU145,EB,CALU6 
NCI-H69, PRSC.con, PRSCJog 
PRSC.con, PRSCJog, RPWE-2 
PRSCJog, MB-MDA-231, NCI-H23 
HT29,MB4V!DM35s, OVCA-R 
NCI-H460 1 NCI458 I BT474 
NO-H345.NCW69, PRSC.con 
PRSC.con, LnCap, PRSCJog 
NCI-H345, PRSCJog, NCI-H520 
NCI-H345, PRSC.con, PRSCJog 
NCW345, PRSC.con, PRSCJog 
DU145, CALU6, PC3 
NC1-H69, PRSC.con, PRSCtog 
NCW345.NCW69, PRSC.con 
CALU6,OVCA*.DU145 
PC3.MB4UDA435S.A549 
MCF7, CALU6, A549 
PRSC.con, NCi-H69, PRSC.tog 
A549,CALU6,LnCap 
NCI-H345, PRSC.con, RPWE-2 
EB, HT29, CALU6 
EB,DU145,HT29 
PRSC.con, PRSCtog, RPWE-2 
PC3,OVCA*,MB4WDA-453 
EB,A549,DU145 
NCI-H345,NCI-H69, PRSC.con 
M*MDA-453,DU145.EB 
MOT, MB-MDA-453, PC3 
EB, DU145, CALU5 
NCI-H69,NCi-H345, PRSCJog 
NCI-H69, NC1-H345, PRSC.con 
NCRW60,NCW23,NCI-H520 
NCI-H345, RPWE-2, PRSC.con 
PRSC.con, PC3, MB-MDA-231 
EB, EB, DU145 
CALU6.CAW6, MB-MDA-453 
PC3, LnCap, LnCap 
LnCap, DU145.EB 
NCI-H345, PRSC.con, PRSCJog 
PRSCJog, NCW345, RPWE-2 
OVCA-R, MCF7, BT474 
NCLH69, PRSCtog, PRSC.con 
DU145, OVCA-R, CALU6 
EB, OVCA-R, MBM>A-453 
MB-MDA453,MCF7 t BT474 
OVCA-R, EB, EB 

BT474, MB-MDA-435S, MB-MDA-231 

MB-MDA-231, MCF7, OVCA-R 

OVCA-R, BT474, CALU6 

DU145,EB,A549 

NC1-H69, PRSCJog, PRSC.con 

NCI-H345, NCi-H69, PRSC.con 

BT474,NCI-H520,DU145 
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322840 AA083710 EST duster (not In UnlGene) 122 

307574 A1283549 EST stngteton (not In UnlGene) with exon 222 

319027 AA716612 EST duster (not In UnlGene) 222 

305925 AAB778B3 EST singleton (not In UniGene) wtth exon 222 

33725 CH.14_p2 gl|5065785 222 

316194 AW298529 Hs255774 ESTs 222 

301119 AF142579 EST 222 

333815 CH22_FGENES282_4 222 

334358 CH22JGENES.378J 222 

303763 AF043250 Hs.3C928 DNA segment on chromosome 19 (unique) 1 1 22\ 

335593 CH22_FGENES.581_32 2.21 

334026 CH22_FGENES.31B_3 121 

322224 AF086064 EST duster (not in UniGene) 221 

309836 AW295497 Hs.1 57397 ESTs 221 
332669 M33374 Hs.661 NADH dehydrogenase (ubiquinone) 1 beta s 221 

307629 AI30Q246 EST singleton (not m UniGene) with exon 221 

300470 T87841 EST 221 

330064 CH.19j)2gi|6165044 221 

338819 CH22_DJ246D7.GENSCAN.1-24 221 

337797 CH22_EfAAC005500.GENSCAN.3-4 221 

328025 CH.08Jl3gij5902482 12 

326240 CH.17jisgil5867260 22 

312865 AW005376 Hs.173280 ESTs 22 

338450 CH22_EA1AC005500.GENSCAN35946 22 

302532 U60181 Hs248115 growth hormone secretagogue receptor 22 

321132 AA081495 EST duster (not in UnlGene) 22 

337787 CH22_EMAC000097.GENSCAN.12W 22 

337032 CH22.FGENES.43&3 22 

300026 M11507 AFFX controt transferrin receptor 2.2 

333139 CH22_FGENES.83J6 21 

334298 CH22_FGENES372J 22 

335002 CH22J=GENES.470_7 2.2 

335000 CH22_fGENES.470_5 2.2 

337298 CH22_FGENES.678-3 22 

302461 AF104253 Hs241381 cofador required for Sp1 transcriptiona 22 

334819 CH22_FGENEa436J5 2.19 

300426 AW452660 Hs.253296 ESTs 2.19 

302569 AC004472 muifeple UniGene natehes 119 

339401 CH22.BA232E17.GENSCAN.7-7 2.19 

328791 CH.07Jsgi|5868309 2.19 

337333 CH22_FGENES.711-3 2.19 

339363 CH22_BA354I12.GENSCAN.336 2.19 

329429 CH.Y_hsgI|5868882 2.19 

336927 CH22_FGENES.348-3 2.19 

336351 CH22_FGENES.616_3 119 

313466 AA004731 Hs.148876 ESTs 2.19 

307433 A1244895 EST smgleton (not In UniGene) with exon 2.19 

336590 CH22.FGENES.51_2 2.19 

310758 AI770001 Hs.209445 ESTs 2.18 

327823 CR05Jsgi|5867968 2.18 

313257 N92638 EST cluster (not in UniGene) 2.18 

335377 CH22_FGENES.543J7 2.18 

303958 AUM2931 EST singleton (not in UniGene) with exon 118 

320153 AF064594 Hs.120360 phosphofipase A2; group VI 118 

335201 CH22J^ENES.508J0 118 

338591 CH2^EMAC005500.GENSCAN.4344 2.18 

331958 AA455960 Hs.99405 ESTs 118 

337218 CH2JLFGENES.614-2 118 

309470 AW118833 EST singleton (not in UniGene) with exon 118 

331896 AA435495 Hs.97174 H sapiens mRNA; cDNA DKFZp566E164 (from 

330275 CH.05_p2gil6671904 118 

335817 CH22J r GENES.618.5 2.18 

332896 CH22_FGENES.35_10 118 

303294 AA205300 EST . 117 

336703 CH22_EMAC005500.GENSCAN.480-2 117 

300115 AI215044 Hs208130 ESTs 117 

330979 H22466 Hs.31795 ESTs 117 

317246 AW105092 Hs.155690 ESTs 117 

329078 CHJ_hsgi|5868597 117 

312554 AI222630 Hs.109390 ESTs 117 

323207 AI052795 Hs.192201 ESTs 117 

301894 AM84435 Hs.41997 alpha-1* glycoprotein 117 

329097 ChXhsgi|5868624 116 

328328 CH.07_hsgij5868375 2.16 

302671 AA522440 Hs.135917 ESTs 116 

329201 CH.X_hsgi|5868718 116 



HT29 ( MWWDA-453 t CALU6 

OVCA-R CALU6,BT474 

LnCap, NCI-H69.NCW69 

NCI-H345, NCI-H69. NCJ-H69 

NC!-H69,PRSC_con,NCI-H345 

CALU6,EB t NCI-H520 

A549, OVCA-R, EB 

MB-MDA435S, EB, MB-MDA-453 

NCi-H345,RPWE-2,PRSC.con 

Cacol NCI-H23, NCI-H520 

NCW345, PRSCJog, RPWE-2 

NCW69,PRSC_con,NCI-H345 

PRSC.con, PRSCJog, RPWE-2 ( 

NCI-H345, PRSC.con. RPWE-2 

NCi-H520,CALU6,OVCA* 

MB-MDA-231, M&MDA453, HT29 

PC3,EB,CALU6 

NCJ-H69,PRSC.con f BT474 

NCI-H69, RPWE-2, PRSCJog 

LnCap,NCm69,NCI-H520 

RPWE-2, PRSC_con, PRSCJog 

EB, LnCap, M&M)A453 

DU145.DU145, OVCA-R 

MCF7, MB-MDA-453, MB-MDA-435S 

PRSC con, PRSC_tag, PRSCJog 

NCI-H23, NCI-H520, NCI-358 

EB, PC3, LnCap 

NCI-H69, NCI-H345, RPWE-2 

HT29,EB,MB^!DA-231 

HT29 t MB4ADA453,Caco2 

PRSC.con, PRSCJog, RPWE-2 

PRSC.con,NCI-H345,NCI-H345 

EB, PC3,A549 

NCm69,A549,HT29 

EB, CALU6, LnCap 

CALU6,BT474,Caco2 

DU145,CALU6,HT29 

RPV\E4, PRSCJog, PRSC.con 

NCI-H345.NCI-H69, PRSCJog 

DU145.PC3.HT29 ' 

NCI-H69, NCI-H345, PRSCJog 

NOH69, PRSC_log, PRSC_con 

CALU6,HT29, OVCA-R 

NCI-H69, PRSCJog, NCK358 

DU145,EB ( m©A-231 

CALU6, DU145, OVCA-R 

NCI-H23,NCt-H23,NCI-35a 

PRSC_con,NCI-H69, PRSCJog 

EB, MB-MDA-231, BT474 

PRSC_COn, NCi-H69, NCI-H345 

PRSCJog, RPWE-2, PRSC_con 

PC3, M&M)A^35s, CALU6 

NOH345, RPWE-2, PRSC_con 

LnCap, PC3,MB^WDA-435s 

OVCA-R, DU145, HT29 

NCI-H69, NCI-H345, RPWE-2 

MCF7,NCl-H23,NCI-H460 

CALU6 f A549,MCF7 

PC3,EB,MB^DA435s 

118 RPWE-2, NCI-H69, PRSCJog 

NO-H345, PRSC.tog, PRSC.con 

A549,Caco2,PC3 

NCI-H345, RPWE-2. PRSC.tog 

MB4\/!DA435s,A549,MCF7 

HT29,BT474,NCI-H69 

PC3, OVCA-R, HT29 

MCF7,EB,MB^DA-435s 

MBMDA-453.DU145.EB 

MB-MDA-453, MB-MDA-231, BT474 

NCW520, OVCA-R, MCF7 

NCI-H69,NCi-H345, PRSCJog 

PRSC_con, LnCap, PRSC.tog 

MB4WDA-231,MCF7,NCU58 

NCI-H345, PRSC_con, NCI-H69 

BT474,DU145,A549 

OVCA-R, PC3, MB-MDA-435S 
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329902 CH.15_p2gP34760 2.16 

334435 CH22JGENESJ85J0 2.16 

330742 AA400979 Hs.25691 caldtonin receptor-like receptoracfivi 116 

328464 CH.07J»gI|5868454 2.16 

334784 CH22J=GENES.43^9 116 

337771 CH2?JMAC000097.6ENSGAN.119.10 116 

300181 AI284955 Hs.157568 ESTs; Weakly similar to ataxin-2 [M.rnusc 2.16 

300268 AI539446 Hs.245450 ESTs 2.16 
309575 AW168096 Hs.195188 glyceraldehyde^phosphate dehydrogenase 2.16 

336548 CH22.FGENES.841 5 2.16 

328506 CH.07_hsgP68471 2.16 

330169 CH.05jj2gI[6165182 2.16 

305480 AA746500 Hs.25911 HU-B associated transcript! 2.16 

302270 R56151 EST 116 

306669 AJ004899 . EST singleton (not in UnlGene) with axon 2.16 

33887 CH.16_hsgi|5B67087 2.16 

327015 CH.21 Jis gi|5867664 2.15 

338576 CH22_EMAC005500.GENSCAN.429-1 2.15 

333592 CH22J : GENES.209J2 2.15 

317253 AW071241 Hs.199685 ESTs 115 

302301 R67493 Hs.127150 ESTs; Weakly similar to ZINC FINGER PROT 2.15 

336858 CH22J=GENES.2934 2.15 

308417 AI640693 Hs.2186 eukajyofic translation elongation factor 115 

338177 CH2^EMAC005500.GENSCAN^1M 2.15 

337592 CH22_C2QH12.GENSCAN.6-7 2.15 

325945 CH.16Jegil5867138 2.15 

335262 CH2^FGENES.520_3 2.15 

333665 CH22_FGENES^44_1 2.15 

333710 CH22_FGENES.250J25 2.14 

304927 AA604728 Hs. 1951 88 gfyceraldehyde^hosphate dehydrogenase 114 

336939 CH2Z.FGENES.417.20 2.14 

313283 W32480 Hs.157099 ESTs 2.14 

306221 AA928686 EST singleton (not in UnlGene) with exon 2.14 

333205 CH22_FGENES.10^5 2.14 

312932 AI804218 Hs.209614 ESTs 114 

328938 CH.08_hsgII5868500 2.14 

326746 CH.20Jisgil5867611 2.14 

337964 CH22_EW^C0055DO.GENSCAN.100-9 2.14 

337984 CH22>EMAC005500.GENSCAN.11(W 114 

337704 CH2LEMACXW0097.GENSCAN.87-6 114 

302162 AF119046 EST 114 

303192 AA081755 Hs.8059 ESTs; Highly sim3ar to SYNAPTOTAGM1N IV 114 

306200 AA926816 EST singleton (not In UniGene) with exon 114 

303996 AW515979 Hs.84298 CD74 antigen (invariant pdypptd of majo 114 

325409 CH.12Jsgl}5866921 114 

308558 A1700145 Hs.172182 poWA)-btnding protein; cytoplasmic 1 2.14 

302185 AA243837 Hs.156915 ESTs 2.14 

303021 W39612 EST 114 

301005 AW451916 Hs.210848 ESTs 114 

336029 CH22LFGENES.672.4 114 

305443 AA736653 EST stogleton (not bi UnlGene) witii exon 114 

335485 CH2Z.FGENESJ70J7 113 

304817 AA584712 EST singleton (not in UniGene) with exon 113 

309859 AW298760 EST singleton (not In UnlGene) with exon 2.13 

326206 CH.17.hsglp867219 113 

303656 AA437189 Hs.122574 ESTs 113 

334745 CH22.FGENES.426J 113 

318504 T26453 EST dieter (not in UniGene) 113 

306839 AI077385 EST singleton (not In UnlGene) with exon 2.13 

303843 W94322 Hs.58094 melanoma Inhibitory activity 2.13 

308444 AI659398 Hs.197097 EST 113 

301322 AW448965 Hs.256305 ESTs 113 

326997 CR21_hs #867660 113 

326793 CR20_hsgi{5857631 113 

320360 H12405 EST cluster (not in UniGene) 2.12 

316301 AW208279 Hs.192009 ESTs 112 

335371 CH22_FGENES.543J 2.12 

301178 AA828385 EST 112 

326136 CH.17Jsgip867202 112 

339213 CH22LFF113D11.GENSCAN.64 112 

335980 CH2*_FGENES.653J 2.12 

314380 AA758797 Hs.192807 ESTs 111 

306779 AI041302 EST singleton (not in UniGene) with exon 111 ' 

335774 CH2?_FGENES.607JO 111 

334914 CH22.FGENES.457J • 111 

304619 AA515554 Hs. 11 9598 ribosomal protein L3 111 



PRSCj»n,NCl-H69,NC|.H345 

PRSC_con,NCl-H345, RPWE-2 

MCF7,MB-MDA453,PC3 

NCI-H69, PRSCJog, NCW345 

PRSCJog, RPWE-2, PRSCjxm 

NCUWRSC.con, RPWE-2 

DU145,EB,CAUJ6 

PRSC.con, RPWE-2, PRSCJog 

A549,NCI-H23,MB-MDA.453 

NCI-H345.NCI-H69, MB-MDA-231 

EB, A549, CALU6 

NCI-H460, MCF7, MB-MDA-453 

EB,DU145,NCI-358 

OVCA-R, MB-MDA-435S, PRSCjxm 

PRSCJog, PRSC.con, NCI-H345 

EB,CALU6,NCI-35ft 

EB, PC3.HT29 

Na-H69,NCl-H345, PRSC.con 
NCI-H69, OVCA-R, PRSC.con 
MB-MDA435S, NCI-H23, MB-TOU53 
PC3,MCF7,MB-MDA435s 
RPWE-2, PRSC_con, NCI-H69 
EB,OVCA*CALU6 
NCW345.NCW23.NCI-H520 
PC3.A549.HT29 

MB-MDA453, MB-MDA435S, DU145 
EB,PC3,A549 

PRSC.con, RPWE-2, PRSCJog 

PRSCJcfl,NCM69, PRSC.con 

LnCap,PC3,MCF7 

NCl-K69,NCI-H345, PRSC.con 

EB.MB4UDA-231.A549 

NCi-H460,PRSC.a>n > NCI-H23 

NCI-H69, PRSC.con, PRSCJog 

PRSCjoon, NCI-H345, RPWE-2 

HT29.PC3.M&MDA-453 

NCI-H345.NCI-H69, PRSC.con 

RPWE-2, PRSCLcon, PRSCJog 

EB.DU145.NCW345 

NC(-H69,NCi.H460,NCI-358 

MB-MDA435s.PC3.EB 

MB-MDA435S, MB-MDA-435S, MB-MDA-453 

MB-MDA453, CALU6, DU145 

LnCap. MB-MDA-231, BT474 

PRSCJog, PRSC.con, RPWE-2 

MCF7.E&M&MDA435S 

MB4/IDA453,MCF7,EB 

PRSC_con, NCI-H69, RPWE-2 

DU145.EB.HT29 

NC1-H69, PRSC.con, RPWE4 

NCW58,r4CW620,NCI-H23 

NCI-H460,MB-MDA435s,MCF7 

MCF7.MCF7.NCI-H520 

NCJ-H69. PRSC.con, LnCap 

EB, MB-MDA-231, LnCap 

LnCap, MB-MDA-435S.EB 

OVCA-R DU145, MB-MDA-453 

RPWE-2, UCap,CALU6 

MCF7, MB-MDM53, MB-MDA-435S 

MB-MDA435S, NCI-H345, RPWE-2 

MB-MDA453,MCF7,BT474 

NCI-H345, LnCap, PC3 

HT29,A549,CALU6 

PRSCJog, PRSC.con, MB-MDA-453 

MB-MDA-231, BT474.HT29 

DU145,DU145,EB 

PC3,MB-MDA435s,DU145 

EB, OVCA-R, LnCap 

RPWE-2, PRSCJog, PRSCLcon 

OVCA-R, PC3.MB4/IDA-231 

BT474,BT474,OVCA-R 

PRSC con, PRSCJog, RPWE-2 

NCI-H345, PRSC.con, PRSCJog . 

PC3,A549,MB-MDA-453 

PRSC con, NCI-H345, NCI-H69 

EB,MB-MDA453,MB-MDA-435s 
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oUaJoo AI199714 


Um 4CQ440 rpT. 

HS.150149 ESTS 


0 44 
2.11 


CALUo, QVGA-K, DU145 


TDCCCO A A 0*14741 

oUoooo AA994743 


EST singleton (not in UruGene) witn exon 


O 44 

2.11 


HT29, MB-MDA-453, CALUo 


447704 


CnZ^cMAUftJlAJy7.ecN5UAN .1/1 -o 


O 44 

2.11 


PRSCjog, PR5C_.con, rpwc-2 


(MO 140 


CHZ?_rGcNcS.o4_1 


O 44 
2.11 


HT29, NUmo9, OVCA-R 


•JlOUol AI24/1J4 


MS. 1 00^01 coTS 


1 44 

2.11 


lin ftfnA4CO MPC7 UTOO 


0U23O0 AA44o44 1 


U- 41QQj<1 CGTr> 

ns.1Joo42 cols 


1 44 
2.11 


m/m oeo mm loo pai i ic 
NCWoo, NCI-H23, CALUo 


*12iJo Nw955 


COT 

col 


n 44 

2.11 


PRSCJog, PRSCjcon, NCI-n345 




ns.ioouoo dcNc protein 


O 44 

2.11 


A549, rUo, KT29 


OOO 4 CO A A cecooo 

422152 AA565332 


EST duster (not in UniGene) 


2.11 


A549, CALUo, to 


326418 


CH.19_hs gl|5867365 


2.1 


co, OVCA-R, DU145 


308709 A1783498 


Hs.181165 eukaiyotic translation elongation factor 


2.1 


kin kink ii)c« kin kmA icn ni ijir 

MB-MDA-435S, MB-MDA-453, DU145 


332737 C01852 


Hs.84359 hypoftetical protein 


2.1 


NCI-H23, A549, DU145 


333283 


CH22.FGENES.128J3 


2.1 


NCW1345, RPWE-2, PRSC_Con 


328636 


CH.07_hsgiI6004473 


2.1 


DU145, EB, MB-MDA-453 


329187 


CH.X_hsg[|5868713 


1 4 

2.1 


NCl-358, NCI-H23, NCI-H460 


305999 AA889603 


EST singleton (not in UniGene) with exon 


2.1 


HT29, OVCA-R, PC3 


333220 


CH22.FGENES.104J2 


2.1 


PRSC_con, PRSCJog, RPWE-2 


335092 


CH22_FGEHES.492J 


2.1 


. IAI > 1 HA nnOA LM| (MIC 

NCI-H69, PRSC_con, NCI-H345 


304887 AA599355 


EST singleton (not in UniGene) with exon 


n 4 

2.1 


DU145, EB, MCP7 


325359 


CH.12Jusgi|5866920 


2.1 


IID tmA A CO CD kfD kin* AtCt 

Md-MDA-453, cd, MB-MDA-435S 


330956 H08730 


Hs.6933 ESTs 


Z1 


NCI-H520, PRSCjan, NCI-H345 


323786 AW449315 Hs.165795 ESTs 


2.1 


OVCA-R, A549,LnCap 


333619 


CH22J=GENES.219J 


2.1 


BT474, OVCA-R, HT29 


324538 AW502979 


EST cluster (not in UniGene) 


2.09 


ah i Apia niuir 

CALU6, A549, DU145 


303405 AA308601 


EST 


2.09 


m U jp aim ic iiAi i tr+r\ 

DU145, CALU6, NCI-H69 


328570 


CH.07Jsgi|5868231 


2.09 


LnCap, MB-MDA-231, DU145 


308971 A1871218 


Hs.224731 EST 


2.09 


NCI-H23, NCI-H460, NCl-358 


330467 K02268 


Hs.22584 prodynorphin 


2.09 


PC3, BT474, MB-M3M53 


334793 


CH22_FGENES.433J 


2.09 


EB, DU145, LnCap 


300908 AA618335 Hs.146137 ESTs; WeaMy similar to putative [C.e!eg 


2.09 


NCI-H345, PRSC_log t PRSCjcon 


309656 AW197060 Hs.195188 glyceraldehyde^phosphate dehydrogenase 


2.09 


A549, NCI-H23, NCI-H460 


320963 AB029041 H&209646 K1AA1118 protein 


2.09 


PRSC.con, PRSCJog, NCW345 


310833 AW295351 Hs.169136 ESTs 


2.09 


PC3, LnCap, MB-MDA-453 


335693 


CH22_FGENES^96J 


2.09 


NCI-H69, LnCap, PRSCJog 


325966 


CH.16Jisgit5867147 


2.09 


MCF7, CAUJ6, MB-MDA-453 


329319 


CriXJisgi|6381976 


2.09 


NCI-H460, EB, DU145 


338526 


CH2^AC005500.GENSCAN^96-14 


2.09 


NCI-H69, NCI-H345, PRSCJog 


336751 


CH22_FGENES.128-5 


2.09 


NCI-H69, NCI-H345, PRSC_tog 


325510 


CH.12_hsgi|5866974 


2.09 


1 iPXMfc At M » pm Ail ii« 

HT29, OVCA-R, CALU6 


323553 AA292626 


Hs.122854 ESTs 


208 


NCI-H345, RPWE-2 NCl-358 


326343 


CH.17_hsgij6525295 


2.08 


EB, LnCap, DU145 


335470 


CH23JGENES.568J 


2.08 


NCRH69, PRSCjcon, PRSCJog 


320122 TO681 


Hs.187515 ESTs 


2.08 


MCF7, MB-MDA-453, BT474 


335320 


CH22LFGENES.534J 


2.08 


BT474, MB-MDA-231, HT29 


307120 AI184343 


EST singleton (not in UniGene) with exon 


2.08 


KT29, MCF7, PC3 


338080 


CH22_EMAC005500.GENSCAN.172-1 1 


208 


LnCap, PC3, HT29 


313113 AI056258 


Hs.122523 ESTs 


2.08 


linn ni 1*1 ir iff* fcifNA j rn 

MCF7, DU145, MB-MDA-453 


337685 


CH2^EM^C000097.GENSCAN.77-1 


2.08 


NCI-H69, NCWi345, PRSC_tog 


327461 


CHX)2_hsgP04455 


2.08 


NCKH23, BT474, NC1^358 


335895 


CH22LFGENES.635J 


2.08 


KT29, MB-MDA-231, NCI-H520 


303933 AW471472 


EST singleton (not in UniGene) with exon 


2.08 


MB-M>A-231, BT474, NCI-H345 


314803 AI935159 


Hs.166841 ESTs; Weakly sirnSar to MYOSIN UGHT CHA 






NOimiSCLE ISOZYMES [H^apiens] 


2.08 


PC3, A549, BT474 


302722 U53530 


EST 


2.08 


DU145, MB-MDA-435S, OVCA-R 


307703 AI318588 


EST singleton (not in UniGene) with exon 


2.08 


HT29, MB-MDA-435S, CALU6 


310558 AI334965 


Hs.176976 ESTs 


2.08 


A549. LnCap, PC3 


315276 AA860090 


EST duster (not in UniGene) 


2.08 


PC3, MCF7, OVCA-R 


306443 AA976950 


EST singleton (not in UniGene) with exon 


Z07 


OVCA-R, PC3,EB 


307961 AW21059 


EST singleton (not in UniGene) with exon 


2.07 


HT29, OVCA-R, CALU6 


329735 


CH.14j>2gP65780 


107 


EB, HT29, OVCA-R 


335193 


CH22.FGENES.507J 


2.07 


EB.A549.A549 


320347 R34423 


Hs.221535 ESTs 


2.07 


CALU6, A549, EB 


316153 AA724474 


Hs.147208 ESTs 


2.07 


MB-MDA-453, PC3, HT29 


300921 AW293224 


HSJ232165 ESTs 


2.07 


HT29, CALU6, CALU6 


319264 T65096 


EST cluster (not in UniGene) 


2.07 


MtHVIUA*400, JVlUr/ , WUJJO 


330204 


CH.05j>2gi|6013606 


2.07 


OVCA^DU145,EB 


317070 AJ142037 


Hs. 125379 ESTs 


207 


PRSC_con, NCI-H345, OVCA-R 


337645 


CH22_EM^C000097.GENSCAN.10-8 


2.07 


NCI-H345,PRSC^tog,NCl-H69 


312501 AW450490 Hs.132886 ESTs 


207 


NCI-H520, CALU6, MCF7 


335587 


CH22_FGENES.581J6 


2.07 


NCI-H69, NCW345, PRSCJog 


311482 AI917706 


Hs.129997 ESTs 


207 


NCI-H520, MCF7, M&WDA-435S 


302488 AF161441 


EST 


207 


EB, DU145, CALU6 


304692 AA554202 Hs.76067 heat shock 27W) protein 1 


207 


MCF7, MB-MDA-453, PC3 


325369 


CH.12_hsgi|5866920 


207 


DU145,DU145,MB^WDA-453 


306284 AA936835 


EST singleton (not in UniGene) with exon 


207 


BT474, MB-MDA-231, HT29 


337402 


CH2LFGENES.752-1 


207 


A549,BT474.DU145 
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327418 CH.02Jwgi|58S7750 


107 


MCF7, MB4i®A^53, M&MDA-435S 


317977 AI004775 Hs.205091 ESTs; Weakly similar to WW domain bhdln 


107 


BT474,MB-MDA453,PC3 


331870 AA428560 Hs.161845 EST 


2.07 


MB-A/DA-231, MB-MDA-435S, BT474 


300750 AA514805 Hs.105464 ESTs 


2.07 


HT29,BT474,BT474 


336657 CH2U=GENES.35-14 


2.07 


MB-MDA453, MCF7, NCW460 


336035 CH2Z.FGENES.678JB 


2J07 


NCl-H69 t PRSCjx]n, RPWE-2 


325320 CH.11 hsgi|5866870 


2.06 


NCI-H69, PRSCJog, PRSCjxm 


306053 AA905312 EST singleton (not In UniGene) with exon 


2.06 


HT29,OVCA^MB^A-231 


333175 CH22_.FGENES.95J 


2.06 


LnCap,HT29,DU145 


304491 AA437096 Hs.1 15502 EST 


2.06 


MB^M35s,CALU6,CALU6 


310632 AI697536 Hs.176991 ESTs 


2.06* 


NCI-H69, PRSCJog, NCI-H345 


338521 CH2^EM*C00550aGENSCAN.39W5 


2.06 


NCMH345, PRSC.bg, PRSCJog 


334900 CH2*_FGENES.45*J4 


2.06 


A549,CALU6,NCl-H69 


337451 CH2ZJ : GENES.774-2 


2.06 


PRSC.con, PRSCJog, RPWE-2 


308792 AI815153 Hs.195188 glyceraldehyde-3-phosphatB dehydrogenase 2.06 


DUt45,BT474,MB-MDA453 


336854 CH22_FGENESJ280-1 


106 


LnCap, EB, MB-MDA435s 


304485 AA434076 EST singleton (not in UniGene) with exon 


2.06 


MB4UDA-231,BT474,CALU6 


326458 CH.19Jisgi|5867400 


2.06 


EB, DU145. LnCap 


303506 AA340605 Hs.105887 ESTs 


2.06 


LnCap, MCF7.CALU6 


33362B CH22 F6ENES.226_2 


£06 


NCI-H520,NCI-358 I NCU358 


3tJ0763 AA190753 EST 


2.06 


NCI-H69,NCI-H345,PRSCjX)n 


334836 CH22LFGENES.439.6 


2.06 


NO-H345, PRSCjxm, RPWE-2 


335217 CH22_FGENES.512l3 


2.06 


PRSCJog, PRSCjan, NO-H69 


338970 CH22_DJ32l10.GENSCAN.26-3 


2.06 


A549, MB-MDA453, UCap 


334842 CH22LFGENES.439J1 


106 


DU145,HT29,CALU6 


309309 AWD0S428 Hs 232857 EST 


2.06 


EB.DU145.OVCA* 


332949 CH22LFGENES47 12 


2.06 


EB,DU145.0VCA* 


310530 AW369863 Hs 150150 ESTs 


2.06 


PRSCjxm, PRSCJog, RPWE-2 


329401 CKX hsai!6682544 


106 


NCI-H69, PRSCjOon, RPWE-2 


316893 AA837332 EST duster (not in UniGene) 


2.06 


OVCA-R, MCF7, M&M3A453 


325022 W95840 Hs.59745 NADH dehydrogenase (ubiquinone) flavopro 


2.06 


Caco2,NCI-358, OVCA-R 


328839 CR14j)2gl|6572062 


2.05 


MB-MDA-231, RPWE-2, CALU6 


306668 AI004890 EST singleton (not in UniGene) with exon 


2.05 


DU145,MBJtfDA-453,MCF7 


315604 AW137442 Hs.136965 ESTs 


2.05 


LnCap, EB, PC3 


318551 AI909951 Hs.239307 tyrosyVtRNA synthetase 


105 


NC1-H345, PRSC_con, RPWE-2 


339344 CH22.BA354I11GENSCAN.28-1 


2.05 


BT474,MBJVIDA-231,A549 


310621 AJ632098 Hs.198099 ESTs 


2.05 


NCI-H69, RPWE-2, MCF7 


327051 . CH.21> g^6531965 


105 


PRSC.con, NCW345, PRSCJog 


336827 CH22.FGENES.236-2 


2.05 


NCI-H345.A549, MB-MDA-231 


311846 AI078033 Hs.177170 ESTs; Moderately simitar to (ill ALU SUB 


105 


OVCA-R, DU145,CALU6 


335036 CH2^FGENES.475J4 


2.05 


NCI-H69 l PRSC„con l NCI-H345 


313100 N52880 Hs.122817 ESTs 


2.05 


RPWE-2, NCI-H345, PRSCJog 


301927 AF014459 Hs.113250 ratinoscrdsis (X-Qnked; juvenile) 1 


105 


MB-MDA-231, NCW345, PRSC.Con 


326070 CH.17_hsgiJ5867175 


2.05 


MB-MDA-435S, MB-MDA-231, BT474 


338514 CH2^.EMAC005500.GENSCAN.392-4 


2.05 


PRSC_oon, PRSCJog, RPWE-2 


328098 CH.05Jisgij5868O2O 


2.05 


DU145, CALU6, EB 


301102 AA679361 Hs.249487 ESTs 


2.05 


NCI-H460,PRSC_con,NC[-H23 


306193 AA923457 EST singleton (not in UniGene) with exon 


2.05 


NCI-H345, PRSC_con, RPWE-2 


317027 AA883808 HS.174148 ESTs 


105 


EB.DU145, CAUJ6 


336102 CH22_FGENES.693J 


2.04 


LnCap, NCW69. PRSCJog 


301372 AI239895 Hs. 130555 ESTs 


2.04 


PRSC.con, RPWE-2, PRSCJog 


333252 CH22_FGENES.116J 


2.04 


N0358,A549,HT29 


322516 AW372340 Hs.159717 ESTs 


2.04 


HT29,MB4IDA-231,BT474 


324148 AA393624 EST cluster (not tn UnKSene) 


2.04 


RPWE-2, PRSCjxin, MB-MDA-231 


338770 CH22_EMAC005500.GENSCAN.520-1 


104 


PRSCj»n,NCI-H69,NCI-H460 


314795 AI798611 Hs.157277 ESTs 


2.04 


EB, PC3,LnCap 


333004 CH22JGENES.60J 


2.04 


A549, NCW58,DU145 


302405 AW245825 Hs.211914 NADH dehydrogenase (ubiquinone) Fe-S pro 2.04 


NCI-H520,CAUJ6,Caco2 


323587 AI905527 Hs.141901 ESTs; Moderately simllartD i!!l ALU SUB 


104 


EB,A549,HT29 


300898 AI276278 Hs.157176 ESTs 


104 


PC3,MB4i/IDA453,BT474 


301506 AI149878 Hs.143519 ESTs; WeaWy similar to testicular tekti 


104 


NCI-H69, RPWE-2, NCI-H345 


325851 CH.16Jisgij5867067 


2.04 


MB-MDA-231 , HT29, EB 


323945 AI125604 Hs.155117 ESTs 


104 


MCF7.DU145.DU145 


303265 AW160951 EST 


104 


LnCap, OVCA-R, DU145 


334135 CH22J=GENES.336_2 


104 


PC3,A549,MB-MDA435s 


329793 CH.14j)2gl|6522661 


104 


DU145.CALU6.HT29 


332595 AA256431 Hs.3244 G protein pathway suppressor 2 


104 


A549, CALU6, NCI-H23 


316059 AW166388 Hs.250181 ESTs 


104 . 


MCF7,HT29,A549 


324104 AW246071 Hs.133122 ESTs 


2.04 


Caco2,A549,MCF7 


306801 A1052653 EST singleton (not In UniGene) with exon 


2.03 


EB, LnCap, PC3 


338096 CH22_EM:AC0055QO.GENSCAN.181-14 


103 


DU145, HT29, CALU6 


327544 CH.03Ji3giJ5867797 


103 


PRSC_con, NCI-H69, NCI-H345 


318813 F13195 EST cluster (not in UniGene) 


2.03 


PRSC.con, RPWE-2, PRSCJog 


325289 CH.11Jisgi|5866903 


103 


EB, OVCA-R, A549 


311099 T56361 Hs.1 82167 hemogtobtn; gamma A 


103 


HT29,BT474,EB 


316079 AA922213 Hs.121735 ESTs 


103 


LnCap, OVCA-R, EB 
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303533 AW151131 EST singleton (not in UnlGene) with exon 2.03 

338579 CH22.EMAC005500.GENSCAN.43W 2.03 

326549 CH.19Jisgl(5867307 2.03 

320012 A1628384 Hs.193745 ESTs 2.03 

334111 CH22J=GENES.330J0 2.03 

327123 CH.21_hsgiI6531971 2.03 

324568 AW502311 EST duster (not in UnlGene) 2.03 

306012 AA896989 EST singleton (not in UniGene) with exon 2.03 

303106 AA012877 EST 2.03 

302194 U52219 Hs.158329 G protein-coupled receptor 50 2.03 

326646 CH20JwgI15867562 2.03 

304060 T61454 EST singleton (not in UnlGene) with exon 2.03 

304667 AA535602 EST singleton (not in UniGene) with exon 2.03 

330514 M83652 Hs.53155 properdin P factor; complement 202 

310324 A1473273 Hs.159674 ESTs; Weakly simflarto GU/TAMATE (H.sap 2.02 

330327 CH.0B_p2gl|5919194 2.02 

308447 A1659965 EST singleton (not in UniGene) with exon £02 

307778 A1344972 Hs231496 EST 2.02 

319459 T87351 Hs.194121 ESTs 202 

300935 AA513644 H&222815 ESTs; Weakly similar to Wiskott-Aldrich 202 

314318 AL037405 Hs.176141 ESTs 202 

334779 CH22.FGENES.432J 202 

336994 CH22.FGENES.410-2 202 

334076 CH2^FGENES327J1 202 

318116 AW452865 Hs.132339 ESTs 202 

328783 CH.20J»giJ6525298 202 

336142 CH2^FGENES.705.4 202 

320913 AA663733 EST cluster (not in UniGene) 202 

301644 AW239364 EST 2.02 

300944 AW081072 Hs.164624 ESTs; Weakly similar to SGt-3 protein [ 2.01 

310080 AW13708B Hs.144857 ESTs 2.01 

311248 AIB63918 Hs. 195078 ESTs 2.01 

319207 R87679 EST duster (not in UniGene) 201 

334760 CH2*_FGENES.428_9 2.01 

338368 CH2^EMAC005500.GENSCAN.325-2 2.01 

317300 AI417007 Hs.166338 ESTs 2.01 

323699 AW178750 EST duster (not in UniGene) 2.01 

301366 AA907713 Hs.221667 ESTs 2.01 

333306 CH2£.FGENES.137_3 2.01 

328031 CH.06_hsgi|5902482 2.01 

301806 AA326007 Hs.12056 asialoglycoprotBin receptor 1 201 

300993 AA564930 Hs.191777 ESTs; Weakly similar to XAP-5-fike protB . 2.01 

320042 TB4520 EST duster (not in UniGene) 201 

331082 R17059 Hs.22100 ESTs 2.01 

308851 AI829820 EST singleton (not tn UnlGene) with exon 201 

301163 AA732066 EST 201 

304734 AA57642B EST singteton (not in UniGene) with exon 2.01 

334855 CH22_FGENES.442_6 2.01 

337121 CH2?J=GENES.519-1 2.01 

331838 AA412498 Hs.104778 ESTs 2.01 

339181 CH2^A59H18.GENSGAN.72-6 2.01 

327554 CH.03_hsgi{58B781l 201 

304108 R63932 Hs.28467 EST 2 

315036 AA534953 Hs. 163297 ESTs 2 

312777 W92809 Hs.138557 ESTs 2 

305888 AA868536 Hs.126145 EST 2 

323185 R52177 EST duster (not in UniGene) 2 

308681 AI761307 EST singleton (not in UniGene) with exon 2 

325755 CH.14 hsgi{6682474 2 

324376 AW499705 EST cluster (not in UnlGene) 2 



331890 AA432166 Hs.3577 succinate dehydrogenase complex; subunit 2 



MB-MDA-231,BT474,UCap 

NCI-H69,NC|.H345, RPWE-2 

NCkH69,Caco2 1 NCW345 

BT474.MB-MDA453.MCF7 

NOH69,MB-MDA-231,BT474 

NOrW5,NCI-H69, RPWE-2 

NCW345,NCI-H520,NCI-H460 

NCW69, PRSCJog, PRSCjan 

RPWE-2, OVCA-R, EB 

NCWH520, NCI-H23, PC3 

NCW460,OVCA*,HT29 

NCI-H345, PRSC.con, PRSCJog 

A549,DU145,EB 

NCm23,NCI-H460 ( NCW58 

Nd-H345,M&WDA-231,BT474 

Na-H345,NCI-H69, PRSCJog 

NCWH345, RPWE-2, PRSCJog 

NCI-H69,CALU6, OVCA-R 

NCi-H460,NCW58,NCm520 

DU145.EB, OVCA-R 

PRSCjxm, LnCap, PRSCJog 

EB.HT29.DU145 

NCI-H345,PRSCj»n,NCl-H69 

OVCA-R, CALU6.EB 

MB-MDA-231, NCW69, NCI-H345 

NCI-H69,PRSC_ccn, RPWE-2 

NC1-H69, PRSCJog, PRSC_con 

DU145,EB,CALU6 

PRSC.con, RPWE-2, PRSCJog 

RPWE-2, NCI-H69,NCI-H23 

PRSCjxin, NCI-H345, PRSC_log 

NCI-H345, NCI-H69, RPWE-2 

HT29.A549, NCW460 

NCI-358 t NCI-H69, PRSC^tog 

Nd-H23 1 NCI-H520 l NCm460 

NCW460,DU145,NOH23 

MCF7, MB-MDA-453, OVCA-R 

PRSC.con, NC1-H345, RPWE-2 

NOH69,NCI-H345,PRSCjxjn 

MB-MDA-231, NCW345, PRSCjxm 

MB-MDA-453, DU145, EB 

HT29,NCW23 1 NCW58 

PRSC_con ( NCkH345,NCWi69 

EB,DU145,M&MDA435s 

DU145.EB.PC3 

OVCA-R, PC3, MBMDA-435S 

LnCap, MB-MDA-453, DU145 

NCI-H345, RPWE-2, PRSCJog 

NCI-H69, NCI-H345. PRSCjxm 

BT474.BT474.MCF7 

NCI-H345,PRSCjJon,NCI-H69 

BT474.HT29.DU145 

BT474, OVCA-R, MCF7 

MB-MDA-435S, MB-MDA-453, LnCap 

PRSC.con, NCW345, M^MDA-231 

HT29,HT29,BT474 

EB.A549.BT474 

RPWE-2 PRSC.con, NCI-H345 

NCI-H345, PRSC.con, PRSCJog 

DU145.BT474.PC3 

CALU6,MB^DA453,A549 
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Table 4 



Pkey: Unique Eos pro beset identifier number 

ExAccm Exemplar Accession number, Genbank accession number 

UnigencID: Unigene number 

Unigene Title: Unigene gene title 



Ptey ExrJVccn UniGJD CompleteJ*ttle 


Ratio Met/BS 


Top 3 expressing cell lines 


313166 AI801098 Hs.151500 ESTs 


1223 


Caco2,EB, OVCA-R 


334593 CH22_FGENES.408_3 


8.06 


NCW69, OVCA-R, OVCA-R 


331084 R20655 Ks.81281 Human done 23732 mRNA; partial ods 


7.89 


LnCap, OVCA-R, EB 


324598 AA502659 Hs.163986 ESTs 


7.77 


OVCA-R, EB.CALU6 


314071 AA192455 Hs 188690 ESTs 


7.76 


CALU6, EB, DU145 


315178 AW362945 Hs.162459 ESTs 


6.81 


OVCA-R, EB.CALU6 


325519 CH.12JisgI|6017036 


634 


NOHK,NCW345,PRSC_CQn 


331433 H68Q97 Hs 161023 EST 


6.16 


OVCA-R, A549.EB 


315021 AA533447 EST duster (not In UniGene) 


6.15 


PC3,EB,CALU6 


337695 CH22JMAC000097.GENSCAN.84-1 


5.84 


NCI-H69,NCI-H345,DU145 


324046 AA378739 EST cluster (not m UnlGene) 


5.77 


OVCA-R, DU145.EB 


300781 AA731209 EST cluster (not In UniGene) with exon h 


5.72 


M&MDA-453, MCF7, MB-MDA-435S 


320701 AI093177 Hs.134923 ESTs 


5.68 


AM9,NCI-H345,NCW69 


332471 AA416967 Hs.120980 nuclear receptor co-represser 2 


5.68 


LnCap, A549, OVCA-R 


331858 AA421163 Hs.163648 ESTs 


5.66 


OVCA-R DU145,Caco2 


330987 H40988 Hs.131965 ESTS5.35 


NCW345.0Va-R,LnCap 


322309 AF086372 EST duster (not in UnlGene) 


531 


OVCA-R, UU145, PC3 


324733 AA582082 Hs.199410 ESTs 


5.17 


PRoCjcon, PRSojog, nuj+kwo 


313577 AA565051 Hs.155029 ESTs 


5.16 


OVCA-R, PC3, EB 


310966 AW271974 Hs.210295 ESTs 


5.15 


Nui-noa, PRoujog, PR5H/_con 


311332 AW292247 Hs.255052 ESTs 


5.05 


Caco2, OVCA-R, EB 


314522 AI732331 Hs.187750 ESTs; Moderately similar to IHI ALU CLA 


5.04 


cB, OU145, HT23 


330886 AA135606 Hs.189384 ESTs; Weakly similar to IHI ALU SUBFAMl 


4.93 


OVCA-R DU145 Caco2 


313597 AW162263 Hs.249990 ESTs 


4.84 


NCJ4W60,NCW345,NCI-H23 


314439 AI539443 Hs.137447 ESTs 


4.84 


DU145,Caco2,MB-MDA-231 


320807 AA086110 Hs. 188536 H sapiens done 24838 mRNA seq 


433 


PC3, OVCA-R, DU145 


311804 AA135159 Hs.203349 ESTs 


4.82 


OVCA-R, PC3, Caco2 


321354 AA078493 EST duster (not in UnlGene) 


431 


DU145, EB, OVCA-R 


325169 H01560 Hs.163816 ESTs; Weakly similar to IHI ALU SUBFAMl 


4.8 


NCI-H345.DU145, LnCap 


312828 AB65455 Hs.211618 ESTs; Moderately similar to III! ALU SUB 


4.78 


DU145,DU145,DU145 


321226 AA311443 Hs.251416 H sapiens mRNA; cDNA DKFZp58&£2317 (from 


4.75 DU145, OVCA-R, M&MDA-453 


327772 CH.05Ju>gI|5867964 


4.74 


HT29,MBM)A-231,NCI-H345 


315642 AA742222 Hs.120634 ESTs 


4.7 


DU145.EB.M&MDA-453 


311905 AA555215 Hs.151913 ESTs 


4.7 


DU145,Caco2 l PRSC_con 


312754 R99834 Hs.250383 ESTs 


4.59 


OVCA-R, PC3.EB 


336637 CH2LFGENES.13-7 


4.58 


NOH69,PRSCJco.,NCI-H345 


331644 T99544 Ha.173734 ESTs; Weakly shnflar to !H! ALU CLASS B 


4.55 


OVCA-R, NCI-H345, Caco2 


336984 CH22_FGENES.401-2 


435 


Caco2, Caco2, EB 


316261 AW134485 Hs.144967 ESTs 


433 


NCI-H460,NCI-H345,Caco2 


300417 AW139492 Hs.245887 ESTs 


432 


DU145, CALU6, EB 


300610 N72596 Hs.39120 DEAD/H (Asp-Gto-Ala-Asp/His) box potypep 4.52 


OVCA-R, PC3, EB 


324718 AI557019 Hs.116467 ESTs 


43 


LnCap, PC3, PRSC_con 


332170 F04112 Hs.177178 ESTs 


4.47 


08002,1^145, DU145 


324042 AA377589 EST duster (not In UnlGene) 


4.45 


NCW345, PRSC.con, PRSCJog 


331148 R73816 Hs.17385 ESTs 


4.44 


CALU6, OVCA-R, EB 


328981 CH.09_hsgIj5868527 


4.43 


HT29,BT474,NC1-H69 


321920 N63915 EST duster (not m UnlGene) 


4.34 


Caco2.A549.A549 


320832 AA214584 EST duster (not In UniGene) 


434 . 


NCI-H23, CALU6, OVCA-R 


321971 AI680459 Hs£01441 ESTs 


4.33 


DU145,HT29,CALU6 


308572 A 1707882 EST singleton (not in UnlGene) with exon 


433 


MCF7, NCI-H345, OVCA-R 


302459 AF169255 EST duster (not in UnlGene) with exon h 


4.28 


MB-MDA-231, OVCA-R, LnCap 


321847 T08401 EST duster (not In UniGene) 


455 


MB-MDM53, NB-MDA-435S, MB^lDA-231 


337884 CH22LEMAC005500.GENSCAN^4-2 


4.23 


HT29,NCI-H23,MB-MOA-435s 


307494 AI269188 Hs.175656 EST 


4.23 


Nd-H23,NCl-H520,NCI-358 


314915 AA573072 Hs.187748 ESTs; Weakly similar to till ALU SUBFAMl 


4.21 


PC3, OVCA-R, Caco2 


336638 CH23LFGENES.14-2 


451 


NCI-H69.NCI-H345, PRSCJog 


319379 T91443 Hs.193963 ESTs 


4.2 


PC3,OVCAR, LnCap 


312332 R33041 Hs.106200 ESTs 


4.19 


NCI-H69, OVCA-R, NCI-H460 


331445 H89093 Hs.41215 ESTs 


4.19 


EB,HT29,DU145 


315841 AW136397 Hs.247572 ESTs 


4.19 


Caco2,MB-MDA453, LnCap 


315712 AI950133 Hs.120882 ESTs; Moderately similar to (111 ALU SUB 


4.18 


LnCap, NCI-H345, OVCA-R 


319559 AA773876 Hs.251597 ESTs 


4.15 


NCW345,Caco2,DU145 


300791 AL138455 Hs.256135 ESTs; Moderately similar to ! Ill ALU SUB 


4.13 


NCW58,RPWE-2,NCWM60 
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312129 AW300867 EST duster (not in UniGene) 4.12 

321168 AA411263 Hs.128783 ESTs 4.11 

313220 AJ971981 Hs.118241 ESTs 4.1 

314022 AW452420 Hs.248678 ESTs 4.1 

321359 AW474412 EST duster (not in UniGene) 4.1 

328841 CH.07_hsgl|6381920 4.09 

337898 CH2?3ftAC005500.GENSCANiW 4.09 

333245 CH2ZJGENES.115J 4.09 

311958 AI247472 Hs.132965 ESTs 4.0B 

314775 AM49880 Hs.188809 ESTs 4.06 

317901 AW150944 Hs.250541 ESTs 4.06 
309985 AW452919 EST singleton (not In UniGene) with axon 4.05 
311004 AA632846 EST duster (not in UniGene) 4.05 
323497 AI523613 Hs.221544 ESTs 4.04 
332347 W60326 Hs.221716 ESTs 4.04 
331388 AA456852 Hs.43543 suppressor of white apricot homotog 2 4.01 
313197 AI738851 Hs.222487 ESTs 3.96 
315710 AA931550 Hs.192785 ESTs 3.95 
316897 AA838114 EST duster (not in UniGene) 3.94 
322564 W86440 Hs.118344 ESTs 3.94 
304605 AA513225 EST singleton (not in UniGene) with axon 3.9 
325726 CH.14JK #552447 3.9 
320190 R32047 Hs.141012 ESTs; Weakly sWIarto lit! ALU SUBFAMl 3.89 
331566 N63062 Hs.48703 EST 3.87 
319403 T98413 EST duster (not in UniGene) 3.86 
324643 AM36356 Hs. 130729 ESTs 3.84 
315298 AI969314 Hs^11377 ESTs 3.82 
321632 AA419617 EST duster (not in UniGene) 3.81 
313219 N74924 Hs.182099 ESTs 3.8 
330833 AA046804 ESTs; Weakly simJar to ALU SUBFAMl 3.8 
327289 CH.01_hsgiJ5867481 3.79 
314429 AW300749 EST duster (not in UniGene) 3.79 
314475 AI911160 Hs.127505 ESTs 3.79 
317130 AW293995 Hs. 192277 ESTs 3.78 
336635 CH2Z.FGEtES.13n5 3.77 
333323 CH22_FGENES.138J6 3.76 
332135 AA620331 Hs.245351 EST 3.75 
316979 AA861087 EST duster (not in UniGene) 3.75 
316435 AI671871 Hs. 1 9261 8 ESTs; Weakly similar to ALU CLASS C 3.74 
315422 AW135357 Hs.192374 ESTs 3.73 
336616 CH2ZJGENES.613.5 3.72 
320258 W93241 EST duster (not m UniGene) 3.71 
300463 N52510 Hs.186470 ESTs 3.69 
306881 A108B695 EST singleton (not in UniGene) with exon 3.68 
337304 CH2^FGEtES.681-6 3.67 
323693 AW297758 Hs.249721 ESTs 3.67 
331073 R07998 Hs.18628 ESTs; WeaJdy simflarto HI! ALU SUBFAMl 3.67 
318162 AW296277 Hs.132171 ESTs 3.87 
318042 AW294522 Hs.149991 ESTs 3.66 
308069 AI470895 EST singteton (not in UniGene) with exon 3.64 
327614 CH.04_hsg({6525283 3.62 
337514 CH2Z.FGENES.809-7 3.62 
332093 AA608794 Hs.112592 ESTs 3.6 
327793 CH.05 - .hsgI|5e67979 3.59 
331053 N7Q242 Hs.183146 ESTs 3.59 
303769 AA134888 Hs.173415 ESTs 3.58 
319872 R97130 Hs.189699 ESTs 358 

317902 A1828602 Hs.211265 ESTs 3.57 
324090 AI656531 Hs.116070 ESTs 3.57 
300120 AW204314 Hs.170784 ESTs 357 
307752 AI339447 EST singleton (not in UniGene) with exon 356 
322438 W44531 Hs.167851 ESTs 355 
311275 AI659166 Hs.207144 ESTs 3.55 
338830 CH22.PJ246D7.GENSCAN.6-7 354 
315647 AA648983 Hs.212911 ESTs 353 
331469 N22273 Hs.39140 ESTs 352 
313445 AI123657 Hs.127264 ESTs 3.51 
330139 CH.21j>2gl|4210430 35 
304450 AA404521 Hs.10326 coatomer protein complex; subunit epsilo 3.49 
325763 CH.14_hsgi]6682475 3.49 
312803 AA677934 Hs.117864 ESTs 3.47 
303654 AA436942 Hs.166308 ESTs 3.46 
317924 AI222324 Hs.166306 ESTs; Weakly simitar to zinc finger prot 3.46 
312354 AA036955 Hs.167040 ESTs 3.44 
337517 CH22.FGENES.814-6 3.43 
324865 AA702138 Hs.114103 ESTs 3.42 



OVCA-R, MCF7.A549 

OVCA-R, 08002, PRSCjan 

OVCA-R, DU145,Caco2 

OVCA-R, EB.PC3 

DU145. OVCA-R. PC3 

NCW69,PRSCJog,NCW345 

NCl-H345,NCi-H69, OVCA-R 

PRSCJog,PRSC.con l NCW345 

EB,DU145,CALU6 

OVCA-R, PC3.EB 

BT474, MB-MDA453, M&-MDA-435S 

MB-MDA-453, NCI-H23. NCW520 

MB-MDA-453, OVCA-R, EB 

LnCap.OVCA-R.EB 

EB.CALU6.PC3 

A549.EB.Caco2 

OVCA-R, EB, PC3 

EB, MB-MDA-231, OVCA-R 

OVCA-R, A549, MB-MDA-453 

NCi4i460.Caco2.EB 

NCW345, RPWE-2, BT474 

OVCA-R, LnCap, LnCap 

DU145.NCW23,PRSCJog 

NCI-H23. NCI-H460, NCI-358 

NOH345,PRSCJog, LnCap 

OVCA^.DU145,NCW345 

NCI-H345,PRSCjxm,PRSCJog 

EB, OVCA-R, A549 

EB, Caco2, OVCA-R 

LnCap, DU145.PC3 

EB.HT29.DU145 

OVCA-R, PC3, PRSCjcon 

DU145.CALU6.NCI-H69 

EB ( PC3.Caco2 

NCW69,NCI-H345,PRSCJog 

NCI-H460.NCW23, PRSCjcon 

NCI-H345.A549,Caco2 

NO-H345,NCI-H69, RPWE-2 

MB-MDA-435S, MCF7, MB-MDA-453 

OVCA-R, A549.EB 

NCI-H69.NCI-H345.RPWE-2 

MB-MDA-231, NCI-H69, EB 

OVCA*,A549, DU145 

CALU6, HT29, EB 

MCF7.M&-MDA453, LnCap 

OVCA-R, MBM)A-453, DU145 

RPWE-2, NCI-H345, OVCA-R 

MB-MDA-231, DU145.CALU6 

EB,HT29,CAUJ6 

Caco2,Caco2,NCi-H23 

NC!-H460,NCW345,NCI-H69 

NCW58,NCWi23,NCW460 

EB, OVCA-R, DU145 

LnCap, OVCA-R, EB 

OVCA-R, EB, Caco2 

HT29, CALU6, CALU6 

PRSC.con, LnCap, RPWE-2 

CALU6, NCI-H345, OVCA-R 

PRSC_oon, NCI-H345, PRSCJog 

NCI-H69,NO-H345,PRSC_con 

NCI-358.HT29, MB-MDA-231 

NCi-H345, NCI-H69, Caco2 

MWWDA-231, PRSC.con, LnCap 

LnCap, PC3, OVCA-R 

OVCA-R, MB-MDA-453, CALU6 

EB.A549, CALU6 

EB, OVCA-R, A549 

EB, CALU6, DU145 

NC1-H345 I NCI-H69,NCM4460 

PC3,BT474, OVCA-R 

OVCA-R, Caco2, RS-MDM53 

DU145.NCI-H460.NOH69 

PRSCjxwi, PRSC_log, NC1-H69 

Caco2, M&-MDA-435S, NCW460 

NCI-H69.HT29.PC3 

NCI-H23,NCl-H460.NCkH520 
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323755 AW300094 EST cluster (not In UnlGene) 3.42 

314452 AL042699 Hs309222 ESTs 3.42 

337911 CH22_EWkAC005500.GENSCAN^ 3.42 

318086 AI025499 Hs.132238 ESTs 3.41 
311859 AA704705 Hs.181044 ESTs; Weakly similar to Chain A; Human 0 

Complexed With L-Canaliie [Haptens] 3.41 

314409 H15560 Hs.131833 ESTs 3.41 

323333 AA228883 EST cluster (not In UnlGene) 3.41 

325890 CH.14Ji9g)|5867021 3.4 

314539 AA398216 Hs.190092 ESTs 3.4 

310567 AI691065 Hs.155780 ESTs 3.4 
330527 S77356 transcript ch21=oligomydn senstMy c 

8 stomach cancer ceD lines, mRNA, 262 n 33 

314660 AA436007 Hs.188760 ESTs 33 

321321 AB033072 EST cluster (not In UnlGene) 33 

323356 AA234009 Hs.188715 ESTs 33 

328592 CH.07_hsgl|5868227 33 

311116 A1631195 Hs332193 ESTs 33 

323853 AA393460 EST cluster (not In UnlGene) 33 

327740 CH.05.hsgl|5867943 33 

326857 CR20_hsgiI6552460 33 

317787 AW339612 Hs,249364 ESTs 3.31 

325760 CH.14Jisgi|6552449 3.3 

337513 CH22LFGENES.809-4 339 

336606 CH2^FGENES.429_3 33 

322695 AW470295 Hs.192152 ESTs 3.29 

314312 AA814971 Hs.257634 ESTs 33 

328224 CH.G6_hsgi}5868101 33 

336128 CH22_FGENES.701J6 337 

332442 AA281323 Hs.4947 ESTs 337 

302514 M14269 EST duster (not In UnlGene) with exon h 3.27 

313749 AW450376 Hs.130803 ESTs 33 

302891 AI681578 Hs.1 14164 ESTs 33 

334690 CH2?_FGENES420_3 33 

308676 AI761036 EST singleton (not in UnlGene) with exon 33 

304254 AA046273 Hs.111334 ferritin; Bght polypeptide 334 

311994 AA648314 Hs.13849 ESTs 334 

321020 AB023170 Hs327850 K1AA0953 protein 334 

316724 AA810788 Hs.123337 ESTs , 33 

326942 CH31JfcgPQ4446 332 

324824 AI826999 Hs324624 ESTs 331 

320789 R78712 EST duster (not in UnlGene) 331 

315070 AW131388 Hs.186735 ESTs 331 

303794 AW241987 Hs.197025 ESTs 3.19 

310237 AI884313 Hs.158906 ESTs 3.19 

313960 AA130859 EST duster (not in UnlGene) 3.18 

336634 CH22_FGENES.134 3.18 

301085 AA779058 Hs.1 90428 ESTs; Weakly similar to NG26 (H^apfensJ 3.17 

313774 AW136836 Hs.144583 ESTs 3.17 

307177 AI188864 EST singleton (not in UnlGene) with exon 3.17 

324025 AJ174661 Hs.190623 ESTs 3.17 

313099 AI307359 Hs.128064 ESTs 3.17 

305536 AA77Q682 EST singleton (not in UnlGene) with exon 3.17 

331916 AA446131 Hs.124918 ESTs 3.17 

314912 A1431345 Hs.161784 ESTs 3.17 

303388 AL039604 EST duster (not In UniGene) with exon h 3.17 

332273 R05818 Hs.173B30 ESTs 3.16 

314697 AW088739 Hs343770 ESTs 3.16 

335344 CH22_FGENES.536J 3.15 

326162 CH.17Jisgij5867168 3.15 

304467 AA424703 EST singleton (not In UniGene) with exon 3.15 

339340 CH223A354I12.GENSCAN37-8 3.15 

325393 CH.1^hsgij5866921 3.13 

315367 AA732484 Hs.169399 ESTs ' 3.13 

307085 AI160868 EST singleton (not in UniGene) with exon 3.12 

313001 N29264 Hs349591 ESTs; Moderately similar to Ilil ALU SUB 3.12 

307606 AI290006 EST singleton (not in UniGene) with exon 3.12 

325710 CH.14_hs gi)6682473 3.09 

313810 AA400079 Hs357854 ESTs 3.09 

335482 CH22J=GENES.570_11 3.09 

326310 CH.17Jisgi|5867277 3.08 

325742 CH.14_hs gi|655244B 3.08 

312467 AI241809 Hs.75458 itbosomal protein L18 3.08 

327309 CH.01Jisgl|6456757 3.07 

310583 AW205632 Hs.211198 ESTs 3.07 

322373 W25673 Hs.130829 ESTs 3.07 



PRSC.con, RPWE-2, NCW345 
NCI-H345, PRSC.oon, PRSCJog 
OVCA^,PC3 l HT29 
CALU6,UCap,OVCA-R 

LnCap ( MB^DA-435s,A549 
NCWTO, LnCap, LnCap 
Caco2,OVCA^NCI-H69 
HT29, CALU6, DU145 
MB-MDA-231.BT474.EB 
PRSC.con, NCW345, NCI-H69 

NCWfi3.Caco2.A549 

OVCA-R,BT474,Caco2 

NCW58,EB,Caco2 

DU145, CALU6, CALU6 

MCF7,NCW58,MW©A-231 

NCl-H520,NCUt23. PRSCJog 

DU145,EB,Caco2 

EB, LnCap, OVGA-R 

NCLH69, MCF7, NCI-H345 

NCI-H345, PRSCjoon. PRSCJog 

EB, CALU6, HT29 

LnCap, NCI-H23 ( NCi-H460 

NCI-K69, A549, NCJ-H23 

DU145.Caco2.EB 

RPWE-2, NO-H69.NCI-H345 

DU145,NCI-H345, LnCap 

BT474,NCW520,MB4ADA.231 

Caco2,PC3,NCi-H345 

DU145,CALUB,NC1-H520 

OVCA-R,NCI-H69,DU145 

LnCap, NCKH34S* PRSCJog 

NCI-H69, RPWE-2, PRSCjan 

DU145,MB-MDA-231,HT29 

OVCA-R.DU145.A549 

NCI-H460, NCI-H23,MB^DA-453 

EB,MCF7,MB^DA-435s 

DU145,OVCA-R,BT474 

HT29.BT474.NCI-H23 

OVCA-R,MBM)A^53 f EB 

DU145, LnCap, EB 

Caco2, NCL358,NCi-H460 

OVCA-RPC3, LnCap 

NCI-358.NCm345.MCF7 

MBJ\/IDA-231,HT29,BT474 

NCW69,NCI-H345,BT474 

NCI-H345, NCW345, NCI-358 

Caco2 t EB J OVCA-R 

EB, CALU6, CALU6 

OVCA-R.DU145.PC3 

MB-MDA-231,BT474,EB 

NCW58 t Caco2,HT29 

EB,OVCA*,Caco2 

EB, BT474, MCF7 

HT29 1 NCW58,Caco2 

MCF7, DU145, EB 

MB^DA-453,DU145,MCF7 

PRSCJog, NCW345, PRSC.con 

BT474,HT29,HT29 

NCI-H23, RPWE-2, NCI-H460 

LnCap. OVCA-R MB-MDA-453 

Caco2,NCI-H23, NCI-358 

OVCA-R.EB, MB-MDA-453 

RPWE-2, PRSC.con, PRSCJog 

NCI-H345,OVCA^Caco2 

MB-MDA-231,Kn29 f NCW23 

NCW69,MBWU53 1 BT474 

EB, DU145, CALU6 

NCI-H460,NCW58,NCW23 

MCF7, MB-MDA453, PC3 

NCI-H23,Nd4W60,HT29 

NCW58,NCW23,NCkH460 

NCI-H69, M&MDA-435S, MB-MDA-435S 

OVCA-R,A549,Caco2 

NCI-H69, PRSCjan, NCI-H345 
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324497 AW152624 Hs.136340 ESTs 3.06 

315095 AAB31815 H&243788 ESTs 3.06 

302445 N79647 EST cluster (not In UniGene) with exon h 3.05 

302842 AW383226 Hs.163834 ESTs; Highly similar to Chp [R.rorvegteu 3.05 

317346 AA952875 Hs.221274 ESTs 3.04 

334650 CH22.FGENES.417J7 3.04 

306644 A1002913 EST singleton (not in UniGene) with exon 3.04 

322682 AJ110679 EST cluster (not In UniGene) 3.03 

311065 AW204582 Hs.224906 ESTs 3.03 

318623 AA355439 Hs.151547 ESTs 3.03 

304978 AA617735 EST singleton (not in UniGene) with exon 3.03 

305554 AA774567 Hs.121774 EST 3.03 

302574 U66199 Hs.249165 fibroblast growth factor 11 3.03 

336202 CH22_FGENES.719_6 3.02 

302893 AL1 17539 Hs.173515 H sapiens mRNA; cDNA DKFZp586H021 (from 

315166 A1343966 Hs.158528 ESTs 3.01 

335606 CH22JGENES.58£_3 3.01 

330058 CH.17_p2gi|6634847 3.01 

303179 AA071215 EST duster (not in UniGene) with exon h 3.01 

307625 A1299617 EST singleton (not to UniGene) with exon 3 

323074 AL119445 Hs.203213 ESTs 3 

336232 CH22J=GENES.736_7 3 

334915 CH22JGENES.457_4 3 

329116 CHJUis QiI5B68550 3 

333495 CH22.FGENES.16BJ 3 

303756 AI738488 Hs.115838 ESTs 2.99 

332134 AA610123 Hs.139240 DKFZP564F1422 protein 2.99 

322916 AW367294 Hs.154091 ESTs 2.99 

318050 AI052093 Hs.133132 ESTs 2.99 

301019 AI147356 HsS8722 ESTs 2.99 

315213 AA587773 Hs.136494 ESTs 2.98 

339251 CH22_BA354l1lGENSCAN.7-5 2.98 

303835 T05645 EST duster (not in UniGene) with exon h 2.97 

300070 A1174603 H&256832 ESTs Z97 

320954 AB028953 H&204121 KIAA1030 protein 2.97 

327624 CH.04JisgIj5867871 2.97 

329029 CHJLhsgi|6525302 2.96 

317040 AA866584 Hs.126154 ESTs 2.96 

328016 CH.06_hsgi}5902482 2.96 

312674 AI762475 Hs.151327 ESTs; Moderately similar to Ifll ALU SUB 2.96 

332301 R70253 Hs.127826 ESTs 2.96 

300951 AI732374 Hs.1Q5B34 ESTs; Weakly simflar to 25 kDa trypsin i 2.95 

318226 AI078446 Hs.134125 ESTs 2.95 

311349 AW292933 Hs.254110 ESTs 2.94 

312757 AI285970 Hs.183817 ESTs 2.94 

316507 AI381515 Hs.158381 ESTs 2.94 

302278 AF018080 Hs. 173730 Mediterranean fever 2.93 

311016 AW173166 Hs.243468 ESTs Z93 

323864 AA340724 Hs.214028 ESTs Z92 

336632 CH22_FGENES.13-2 2.92 

328886 CH.07J1S gi|6588003 2.92 

301859 T61587 EST duster (not in UniGene) with exon h 2.92 

323775 AA329856 Hs.143022 ESTs 252 

315426 AI391486 Hs.128171 ESTs 2.92 

322264 AF086242 EST duster (not in UniGene) 2.92 

315135 AA627561 Hs.192446 ESTs 2.91 

327982 CH.06_hsgil5866216 2.91 

314530 AI052358 Hs.131741 ESTs 2.91 

315003 AA527650 Hs.156037 ESTs 2.9 

339032 CH2^DA59H18.GENSCAN.25-1 2.9 

308379 AI623950 Hs2186 eukaryotic translation elongation factor 2.89 

312133 T87714 Hs.221665 ESTs 2.88 

307992 AI434166 EST singleton (not in UniGene) with exon 2.88 

308010 AI439190 Hs.181165 eukaryotic transtation elongation factor 2.88 

320154 AA336019 Hs.119559 ESTs 2£8 

331496 N34929 Hs.171984 ESTs Z86 

320016 H57622 Hs.194574 ESTs 2.86 

317923 AW450544 Hs.220751 ESTs 2.86 

301822 X17033 Hs.1142 integral; alpha 2 (CD49B; alpha 2 subuni 2.86 

311759 AA705075 Hs.169536 Rhesus blood group-associated glycoprote 2.85 

315083 AI221325 HsJ210655 ESTs 2.84 

317759 A1908455 Hs.202460 ESTs; Weakly similar to hypothetical L1 2.83 

313980 AI633205 Hs.159914 ESTs 2.83 

310941 AI453402 Hs.173705 ESTs; Weakly similar to llll ALU CLASS C 2.83 

313593 AJ911488 Hs213724 ESTs 2.83 

314973 AW273128 Hs.254669 EST 2.82 



NCI-H345, RPWE-2, PRSCjan 
Caco2,DU145,EB 
OVCA-R, A549.NCI-H460 
A549.DU145.NCI-H23 
BT474, HT29.HT29 
MCF7.BT474, OVCA-R 
CALU6,MCF7,BT474 
NCI-H345, RPWE-2, OVCA-R 
PRSCJog, PRSC.oon, NCI-H460 
DU145, M&MDA-435S, KT29 
CALU6,BT474,MB-M>A435s 
EB,NCI-H460,Caco2 
HT29.DU145,PC3 
NCI-H69,NCI+t23.NCi-H23 
3.02 EB,0U145,CALU6 • 
Caco2, EB.NCI-H69 
NCI-H23,NCkH520, NCI-H345 
OVCA-R HT29. LnCap 
MCF7, RPWE-2, MB-MDA453 
MB4l/©A-231,LriCap l BT474 
NCi-H23,NCI-H520,NCI-H460 
HT29,BT474,MB4WDA-231 
NCI-H345, PRSCjsn, NCI-H69 
NCm69,PRSC_coa RPWE-2 
OVCA-R, NCW69.NCI-H345 
HT29,PRSC_con,DU145 
EB.A549.MCF7 
DU145.DU145, OVCA-R 
NO-H345,DU145,NCI-H520 
NCI-358, NCI-H69, MB-MDA435S 
MB^VIDA-231,BT474 f LnCap 
NCW69.PRSC_tog.HT29 
BT474, NCI-H345, LnCap 
DU145.A549, OVCA-R 
LnCap, DU145.PC3 
EB,DU145, LnCap 
NCI-H69, PRSCJog, LnCap 
DU145,EB, LnCap 
NCI-H345,PRSC_con, DU145 
OVCA-R,NCI-H69 t NO-H69 
OVCA-R, DU145, MB-MDA-231 
NCI-358, NC^H460,Caro2 
NO-H460,NCI-H23,Na358 
EB.DU145, OVCA-R 
DU145,UCap, LnCap 
PRSCjxm, PRSCJog, RPWE-2 
EB,NCi-H69,DU145 
MCi-H345.LnCap.UCap 
EB, Caco2,HT29 
NCI-H69. NO-H345, MB-MDA-231 
HT29,PC3,LnCap 
LnCap, EB.EB 

PRSC.oon, PRSCJog, RPWE-2 
CALU6, EB, A549 
Caco2, OVCA-R, DU145 
EB.HT29.DU145 
UCap,MB«A453,NCi-H69 
NCI-H460.NCW520, RPWE-2 
PRSC.con, RPWE-2, MB-MDA-231 
NCI-H69, PRSCjcon, RPWE-2 
BT474, MB-MDA-231. HT29 
Caco2,MB-MDM53, MCF7 
NCI-H520, MCF7, NCW23 
Caco2,NCI-H69,NCI-H345 
MB4VIDA-453,DU145,EB 
MBM5A-453.PC3, MCF7 
PRSCcon, RPWE-2, PRSCJog 
NCW345. PRSCjcon, PRSCJog 
PC3,BT474 f CALU6 
DU145.HT29, MB-MDA-231 
PRSC.con, RPWE-2, NCW345 
HT29,M&M5A-231,BT474 
Caco2.MB-MDA-453.A549 
NCI-H345,MCF7,Caco2 
LnCap, Caco2,NCI-H460 
BT474, LnCap, RPWE-2 
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310950 AI582758 H3.170561 ESTs 182 

323826 AL039822 Hs.207604 ESTs 2.82 

325410 CH.12_hsgI|5866921 2.81 

313911 AI565458 Hs.116385 ESTs 2.81 

334244 CH22J=GENES.365.5 181 

309333 AWQ25709 EST singleton (not in UnlGene) with exon 181 

328467 CH.07_hsgi|5B68434 2.81 

318553 AW250501 EST duster (not in UnlGene) 181 

326412 CH.19_hsgi|5667362 • 2.81 

303407 AA309616 EST duster (not in UnlGene) wtfii exon h 2.8 

328462 CH.07Jisgi|5868433 2.8 

335157 CH22J=GENES.501J 2.8 

313458 AA007259 Hs.255853 ESTs 2.79 

310416 AI695047 Hs.202395 ESTs 179 

317709 AI435973 Hs.128056 ESTs 179 

321415 A1377596 Hs.3337 transmembrane 4 superfamfly member 1 179 

313693 AW469180 Hs.170651 ESTs 2.79 

309438 AW102802 Hs.225787 ESTs; Moderately similar to hypothetical 2.79 

308961 A1S7Q248 EST singleton (not in UnlGene) with exon 2.78 

329107 CHJLhsgI|5868626 178 

313975 AW025024 Hs.651 14 keratin 18 178 

330901 AA157818 Hs.238380 Human endogenous retroviral protease mRN 178 

311749 R06249 Hs.13911 ESTs 2.78 

329853 CH.14_p2glI6682295 178 

322340 AF088076 EST duster {not in UniGene) 2.77 

326806 CHi0.hsgiJ6469835 2.77 

314661 AA436432 EST cluster (not in UniGene) 177 

322135 AF075Q82 EST duster (not in UniGene) 177 

331849 AA417078 Hs.193767 ESTs 177 

301056 AI797955 HS.208Q76 ESTs; Weakly similar to D(4) DOPAMINE RE 2.76 

327739 CH.05.hsgi|5867942 2.76 

306016 AI445116 EST singleton (not in UniGene) with exon 2.76 

331549 N56866 Hs237507 EST 176 

331851 AA418599 Hs.98303 caveofin3 175 

315023 AA533505 Hs.185644 ESTs 175 

335565 CH22.fGENES.579J 175 

306137 AA916176 EST singleton (not in UniGene) with exon 2.74 

332240 N54803 yv31d2.s1 Scares fetal iiver spleen 1NFL 

3" similar to contains L1.t3 L1 repetit 2.74 

313246 N90762 Hs.159454 ESTs 2.74 

303642 AW299459 EST duster (not in UniGene) with exon h 174 

325513 CH.12_hsgpi7035 174 

337236 CH22LFGENES.639-2 174 

311555 AW407892 H*244807 ESTs 2.74 

339266 CH22_BA354I12.GENSCAN.104 173 

300127 AW028615 Hs.235224 ESTs;WBaWys^totoKlAA0422[H«api 173 

311741 R00099 Hs.193642 ESTs 172 

310915 AW449673 Hs.201893 ESTs 172 

324982 T31689 Hs.98518 ESTs 171 

305030 AA629988 EST singleton (not in UniGene) with exon 171 

315396 AW296107 Hs.152686 ESTs 169 

319098 AI908374 EST duster (not in UniGene) 2.69 

309119 AI927384 Hs.228499 EST; Moderately similar to PK-120 precur 2.69 

312095 AW444937 Hs.233482 ESTs 168 

324316 AI291330 EST duster (not in UniGene) 168 

331367 AA425668 Hs.41641 ESTs; Weakly simBar to CAGH4 [H^aptens 168 

339116 CH22_DA59H18.GENSCAN.494 168 

324297 AI565566 Hs.168587 ESTs 168 

31872B Z30201 EST duster (not In UnlGene) 168 

304813 AA584540 EST singleton (not in UniGene) with exon 168 

312393 N34376 Hs.191 659 ESTs; WeaMysirrtJar to HI! ALU CLASS E 2.68 

330671 AB002302 Hs.92236 KIAA0304 gene product 2.67 

305406 AA723860 EST singleton (not in UniGene) with exon 2.66 

330957 H08778 Hs.133521 ESTs 166 

300350 AI871129 Hs.172597 ESTs; Weakly simBar to zinc finger prot 166 

322302 W76021 EST duster (not In UniGene) 166 

321891 AW157424 Hs.165954 ESTs 166 

300124 AI217394 Hs.242447 ESTs 165 

302747 AF062275 EST cluster (not in UniGene) with exon h 2.65 

306741 AI802780 Hs209002 ESTs; WeaWy similar to III! ALU SUBFAMl 165 

310802 AI631546 Hs.159732 ESTs 165 

300694 AA063406 EST duster (not in UnlGene) with exon h 165 

311395 R23313 EST duster (not in UnlGene) 164 

336538 CH22_FGENES.840_2 164 

316473 AAB29961 EST duster (not in UnlGene) 164 

328134 CH.06_hsgP68039 164 



EB, MB-MDA453, LnCap 
PC3 t HT29,CAUU6 
MB-MDA453, PRSC.con, NCW58 
PRSC_con, EB t RPWE-2 
OVCA-R, PC3.MB-MDA453 
NCI-H460, NCI-H23. NCI-358 
EB, OVCA-R, HT29 
BT474.NCW23, MB-MDA-231 
BT474, PRSCJog, RPWE-2 
CALU6,NCW345,DU145 
BT474,CALU6,MCF7 
NCI-H69, NCI-H345, PRSCJog 
OVCA-ft, DU145, LnCap 
DU145, MB-MQA-435s, PC3 
NCI-H460,NCI-358,DU145 
A549.PC3, OVCA-R 
OVCA-R, MCF7, EB 
PC3, OVCA-R, DU145 
BT474, MB-MDA-231, EB 
DU145,MCF7,M&MDA435s 
Caco2,EB,DU145 
PC3,NCRi520.BT474 
OVCA-R, MBMDA453, MCF7 
BT474,BT474,HT29 
NCW345,Caco2, LnCap 
NCI-H69, NCI-H345, MB-MDA-231 
NCI-H460, NB-MDA435S, CALU6 
NCI-358,NCW460,Caoo2 
DU145, EB, CALU6 
NCI-H69, RPWE-2, PRSCjxm 
EB.PC3, LnCap 
LnCap, HT29, MB-MDA-231 
MB-MDA-453, MCP7, OVCA-R 
MB«A-231,NCI-H345,BT474 
PRSC.con, OVCA-R, EB 
OVCA-R, EB.A549 
EB. LnCap, DU145 

DU145, EB, CALU6 
NCW69.NCW345, PRSC.bg 
EB.A549,Caco2 
MB-MDA431, NCI-H345, BT474 
MCF7,MB-MDA453,NCl-H69 
BT474,NCW345,NCI-H69 
CALU6.DU145, OVCA-R 
NCI-H345, RPWE-2, PRSCJog 
LnCap, PC3, OVCA-R 
DU145,EB,MB-M)A435s 
PRSC.con, PRSC_tog, RPWE-2 
DU145.DU145.NCW58 
OVCA-R, Caco2, EB 
RPWE-2. LnCap, PC3 
LnCap, NCW23.NCW58 
Caco2, OVCA-R, HT29 
NCI-H460, Caco2, PRSCJog 
I^MDA435s, NCI-H520, NCI-H460 
DU145,EB,CALU6 
PRSC.con, OVCA-R, PRSCJog 
LnCap, Caco2, PC3 
BT474, OVCA-R, RPWE-2 
NCl-H345,PRSC_con,EB 
NCI-358, OVCA-R, Caco2 
OVCA-R, EB.MCF7 
EB.PC3, OVCA-R 
NCI-H23,NCW520,NCI-H460 
DU145, OVCA-R, PC3 
EB, OVCA-R, Caco2 
PRSC.con, A549, HT29 
NCl-H23,BT474,MCF7 
PC3.EB, OVCA-R 
PRSC.con, PRSCJog, NCMH69 
BT474, EB.MCF7 
EB, OVCA-R, DU145 
DU145,NCt-H460, NCW58 
LnCap. OVCA-R, EB 
LnCap, EB.CALU6 
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329330 




/-VII v kn ••HCOCOOfVO 

CH JLns glJSBoooOo 


2.64 


316664 AS042101 




EST duster (not in UniGene) 




328015 




CH.0D„nS gip9U24o2 


2.63 


308991 AI879831 




EST singleton (not In UniGene) with exon 


2.63 


323899 AL042966 




^fl'l -1. ,„1,.- /_ -.1 »_ | l-.l^__._A 

Est ouster (not In UnlGene) 


262 


321708 AA476817 




EST cluster (not In UnlGene) 


2.62 


301752 T75247 




EST duster (not in UnlGene) with exon h 


2.62 


309351 AW057547 


WIMP i . j. t^.^L !_ 1 »1L _ 

EST singleton (not in UniGene) with exon 


262 


314412 AI864270 


Hs.155654 


ESTs 


2.62 


309441 AW103055 HsJ244230 


EST 


2.62 


335993 




CH22_FGENES.656_6 


2.61 


318196 AKJ56776 


Hs.133397 


ESTs 


2.6 


322880 AA310521 


Hs.50848 


m'f * |lf u lJ., — t fl a I/If iJlAAA ■ _ * - 

ESTs; Weakly simitar to WAA0B62 protein 


2.6 


300558 AI540051 


Hs.122638 


ESTs 


2.6 


318594 AA918320 Hs.224581 


ESTs 


2.6 


308554 AI698132 


Hs.201923 


EST 


2.6 


335108 




CH22LR3ENES.494J4 


2.6 


312483 AI417526 


Hs.184636 


ESTs 


2.59 


311981 AW452773 Hs^57612 


EST 


259 


319359 F13458 




EST duster (not In UnlGene) 


2.59 


300230 A1377746 


K3.158846 


ESTs 


2.59 


316504 AW135854 Hs.132458 


ESTs 


259 


322337 AA249804 




EST duster (not in UniGene) 


2.59 


301775 AW247670 


EST duster (not in UniGene) with exon h 


259 


301089 AA666396 


Hs.220727 


ESTs 


258 


331213 T88698 


Hs.163862 


ESTs 


258 


321121 W23285 




EST duster (not in UniGene) 


258 


316634 AW241910 Hs.122254 


ESTs 


258 


322141 AF075092 




EST duster (not In UnlGene) 


2.58 


312108 T82331 


Hs.127453 


ESTs 


2.58 


339071 




CH22_DA59H18.GENSCAN.34-1 


258 


311666 AW389509 Hs^23747 


ESTs 


257 


318662 AI285898 


Hs.115367 


ESTs 


257 


317010 AA863395 




EST duster (not in UniGene) 


257 


324710 AI742028 


Hs.120884 


ESTs; Weakly similar to RAS-RELATED PROT 


327888 




CH.06Jisgi|5868149 


256 


336149 




CH22LFGENES.706.5 


256 


312816 H74319 


Hs.188620 


ESTs 


256 






CH06Jisgil58B7994 


256 


316761 AI911173 


Hs.213722 


ESTs 


255 


336958 




CH22LFGENES.367-1 


255 


325043 W27919 


Hs.32944 


inositol polyphosphates-phosphatase; ty 


255 


315417 AW452360 Hs.186770 


ESTs 


255 


331603 N78656 


Hs.161535 


EST 


255 


309403 AW082954 




EST singleton (not in UniGene) with exon 


255 


337289 




CH22_FGENES.672-8 


254 


314242 A1570943 


H&246280 


ESTs 


254 


328053 




CH.06_hsgi{5902482 


254 


307215 A1193189 




EST singleton (not In UniGene) with exon 


253 


327566 




CH.Q3Jisgi|5867811 


253 


326338 




CH.17_hsgi|6056311 


253 


318115 AI384027 


Hs.159130 


ESTs; Moderately similar to IIII ALU SUB 


253 


307437 AI245683 




EST singleton (not in UniGene) with exon 


252 


322059 AA412371 


Hs.121344 


ESTs 


2.52 


322505 AF147315 




EST duster (not in UniGene) 


2.52 


314032 AW081897 Hs.193211 


ESTs 


2.52 


336125 




CH22.FGENES.701J2 


251 


312765 AI692908 


Hs.181873 


ESTs 


251 


335523 




CH22_FGENES.572_3 


251 


327585 




CH.03J*gi|5867825 


.251 


323183 AW393850 




EST duster (not in UnlGene) 


251 


314418 AI478722 


Hs.232275 


ESTs; Moderately similar to 111! ALU SUB 


2.51 


313361 A1359782 


Hs.137312 


ESTs 


25 


305632 AA805276 




EST singleton (not in UnlGene) with exon 


2.5 


331689 W90131 


Hs.184675 


ESTs 


25 


323438 AI540243 


Hs.113817 


ESTs 


25 


315742 AI821724 


Hs.143198 


H sapiens PAC done DJ0872F07 from 7q31 15 


305971 AA886874 




EST singleton (not in UniGene) with exon 


25 


336633 




CH22_FGENES.1W 


25 


304746 AA577793 




EST singleton (not In UnlGene) with exon 


249 


327925 




CH.06Jtsgi|5868172 


249 


336055 




CH22J r GENES.683_4 


249 


326888 




CH.07JisgiI6588003 


248 


311244 AW016694 Hs.197689 


ESTs 


248 


327155 




CH.01_hsgiJ5867549 


248 


334907 




CH22_FGENES.453_2 


248 



EB, CALU6, DU145 

NCI-H345, MB-MDA-231, PRSCJog 

BT474.HT29, MB-MDA-231 

BT474,EB,NCI-H23 

DU145,A549,CALU6 

EB,A549,CALU6 

HT29,BT474,NCI-H345 

NCI-H23, PRSCjan, LnCap 

CALU6, MB-MDA-231, BT474 

BT474, MB-MDA-231, MB-MDA-453 

NCI-H460. NCI-358. NCI-H520 

EB, CALU6, HT29 

DU145.A549.PC3 

OVCA-R, NCI-H69.MCF7 

PC3.MB4WDA-453.DU145 

LnCap, EB.NCI-H345 

NCI-H69, NCI-H345, MB-MDA-231 

PC3.DU145, OVCA-R 

NCW460,MB^DA453,NCI-H23 

LnCap,NCl-H460, MB-MDA-231 

HT29, NCk358, NCW345 

DU145,EB,CALU6 

NCI-H69.NCI+l345 l NCI4tt45 

NCI-H345, RPWE-2, PRSCJog 

PRSCJog, PRSC_con, RPWE-2 

DU145, EB, OVCA-R 

NCWmMB-MDA^s.PCS 

MCF7.HT29.BT474 

PC3, OVCA-R, HT29 

A549,CALU6,Caco2 

CALU6,DU145,EB 

OVCA-R, MBM3A-231, BT474 

OVCA-K,DU145,EB 

NCW520, PRSC_con f NCW58 

257 LnCap, DU145, MB4/IDA-453 

NCI-H345, MB-MDA-435S, RPWE-2 

NCI-H69,PC3,A549 

EB,Caco2,NCWM60 

NCI-358, NOH520,NCI-H23 

NCI-H345, NCI-H460, MB4WDA-231 

HT29, CALU5, CALU6 

NCW460,NCI4123 I HT29 

NCI-H345,NCI-H69,PRSCj>on 

NCI-H345, PRSC.con, PRSCJog 

BT474 ( M&MDA-231 I MCF7 

BT474,HT29,MWWDA^31 

Caoo2, MB-MDA-435S, MB-MDA453 

MB^WDA-231, DU145, M&-MDA-453 

HT29,CALU6,MBJUDA-231 

NCf-H69,NCI-H520,NCI-H345 

PC3.A549.DU145 

DU145,EB,PC3 

NCl-H23,NCI-H520, NCI-358 

EB.DU145, OVCA-R 

PRSCjcon, RPWE-2, NCI-H69 

NCI-H345, LnCap, DU145 

NCI-H69, LnCap, DU145 

NCI-H23, NCI-358, NCI-H520 

HT29 f BT474,OVCA^ 

HT29,NCI-H460,M&M)A-453 

MB-MDA-231, LnCap, RPWE-2 

EB,DU145,DU145 

CALU6,HT29,DU145 

MB-MDA-453, NCI-H460, NCKH23 

NCI-H69,EB,A549 

NCI-H345. PRSC.con, MB4WDA-231 

MCF7, IV&MDA-453, IVBWDA-435S 

NCI-358, NOH23, NCI-H520 

NCI-H69, NCI-H345, PRSCJog 

NCI-H69,BT474. MB-MDA-231 

NCW58, NCI-358, NCI-H460 

EB.HT29.M&MDA-231 

MB-MDA-435S, MB-MDA^53, PRSC log 

NCW-I345,MCF7,PC3 

NCI-H69, MB^DA-231. NCI-H345 

DU145.NCI-H345, MB-MDA-231 
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314887 AA910236 Hs.139469 ESTs 2,48 

339435 CH2LDJ579N16.GENSCAN.18-10 2.48 

334172 CH22_FGENES.349_5 2.48 

320767 AA299525 EST duster (not in UniGene) 2.48 

338772 CH2LFGENES.156-1 2.47 

326957 CH.21Jisgi|6469836 2.47 
308505 A1686615 Hs.200778 EST; WeaWy similar to SALIVARY PROLINE- 2.47 

321325 AB033100 EST cluster (not in UniGene) 2.47 

313149 AW291092 Hs.201058 ESTs 2.47 

338325 CH22_EM^C005500.GENSCAM307.7 2.46 

307877 AI368880 EST singleton (not in UniGene) with exon 2.46 

311525 AI799444 Hs.247095 ESTs; Moderately simflar to I! II ALU SUB 2.46 

337023 CH2aj r GENES.433-12 2.46 

300916 AI361798 Hs.164675 ESTs 2.45 

302919 AL137382 EST duster (not in UniGene) with exon h 2.45 

320303 AL079289 Hs.137154 H sapiens mRNA full length insert cONA c 2.45 

318359 AI097439 Hs.135548 ESTs 2.45 

314384 AA535840 Hs.162203 ESTs; WeaJdy similar to alternatively sp 2.45 

32S763 CH.20_hsglI6598307 Z45 

319900 AW408392 EST duster (not in UniGene) 2.45 

314451 AA58636B Hs.190232 ESTs 2.45 

300641 AW237699 Hs.1 18346 ESTs 2.44 

324368 AW299374 EST duster (not in UniGene) 2.44 

336510 CH2£_FGENES334_5 2.44 

326876 CH^OJwgi|6682507 2.44 

307753 AI340509 Hs.182426 ribosornal protein S2 2.44 

317071 M78728 Hs.132694 ESTs 244 
313877 AA767869 Hs.250113 ESTs; Moderately similar to thyroid norm 

component TRAP150 [H sapiens] 2.44 

315974 AW029203 Hs.191952 ESTs 2.43 

322970 A1885052 Hs.1 42287 ESTs; Weakly similar to IIH ALU CLASS P 2.43 

317733 AI028257 Hs.132317 ESTs 2.43 

313599 AA748749 Hs, 136742 ESTs 2.42 

323014 AA305198 EST duster (not in UniGene) 2.42 

324980 AA969121 H&254296 ESTs 2.41 

301326 AA883831 H&252924 ESTs 2.41 

308695 AI763350 EST singleton (not in UniGene) with exon 2.41 

330166 CH.Q2_p2gi}6648220 2.41 

317552 AW451400 Hs.127019 ESTs 2.41 

320572 AI929508 Hs.1 59590 lymphocyte antigen 6 complex; locus H 2.41 

315618 AI287341 Hs.154029 ESTs; Weakly similar to TRANSCRIPTION FA2.41 

331610 N91109 Hs.54681 ESTs Z41 

311731 AW393528 Hjl246875 ESTs 2.41 

318571 243383 Hs.8053 ESTs Z4 

334958 CH22.FGENES.465J7 2.4 

323570 AL038623 Hs.208752 ESTs; Weakly similar to till ALU SUBFAMI 2.4 

301685 W67730 EST duster (not in UniGene) with exon h 2.4 

303849 AW163324 EST duster (not in UniGene) with exon h 2.4 

325702 CH.14_hsgq5867028 2.4 

313074 N48261 Hs.127171 ESTs 2.4 

308994 AI880051 EST singleton (not in UniGene) with exon 2.4 

330338 CH08j)2gi|5457162 2.4 

327274 CH.01_hsgq5867470 14 

325953 CH.16Jtsgl}5867140 2.4 

333281 CH22.FGENES.128J 2.4 

314778 AW079559 Hs, 152258 ESTs 239 

317005 AI800251 Hs.197773 ESTs 238 

334257 CH22.FGENES367 5 238 

324783 AAS40770 EST duster (not In UniGene) 238 

300949 AA534325 Hs.162183 ESTs 2.38 

314957 AW029274 Hs.208368 ESTs; Moderately similar to lid ALU SUB 2.38 

324350 AW292501 Hs.157174 ESTs; Weakiy similar to similar to SH3-C Z38 

338235 CHZLEMAC005500.GENSCANi60-16 238 

300937 AW297302 Hs.255631 ESTs 238 

317439 AW451327 Hs.170623 ESTs 238 

324745 AI742120 Hs.1 16506 ESTs; Weakly similar to (111 ALU SUBFAMI 238 

338306 CH22„EM:AC005500.GENSCAN.3Q2-2 238 

318765 242071 Hs^3961 ESTs 238 

310254 AI239811 Hs.157491 ESTs 2.37 

305116 AA649244 EST singleton (not in UniGene) with exon 237 

324016 AL045285 Hs.246849 ESTs; Moderately simitar to lill ALU SUB 237 

322774 AA131111 EST duster (not in UniGene) 237 

335745 CH22.FGENES.601 J 6 237 

300972 AI979100 Hs.211518 ESTs 237 

338809 CH223MC005500.GENSCAN.531-10 237 

316983 AM80204 Hs.177131 ESTs 237 



DU145,A549,A549 
NC!-H69,MCF7,BT474 
NCI-H59.NCI-H345, PRSCJog 

nci-358,nci-H23,nci-H46q 
nc*-358,nci-358,nci-h23 

BT474, RPWE-2, PRSCjan 
MCF7, MB-MDA-453, MB-MDA-435s 
EB, CALU6.A549 
NCI-H345, PRSC.con, RPWE-2 
BT474, LnCap, EB 
NC1-H23, PRSCJog, NCI4i520 
PRSCjxot, PRSCJog, NCI-H345 
OVCA-R CALU6,PRSC_con 
LnCap,DU145,CALU6 
LnCap,MB4/IDA-231,CALU6 
BT474, MB4/IDA-231, M&JUDA-453 
NCI-H460, MB^DA-453, NOH345 
OVCA-R, PC3, EB 
NCI-H69, NCI-H345, RPWE-2 
Caco2,NCI-H460, NCI-H23 
PRSC.OOn, NCI-H345, M&MDA-231 
NCI-H345, PRSCJog. PRSCjan 
PC3, DU145, OVCA-R 
NOH69, RPWE-2, PRSCjan 
NCW23,NCl-H460,NCI-H520 
NCW23,NCI-H460.Caco2 
NCI-H345, NCI-H69, RPWE-2 

DU145,LnCap,CALU6 
EB, DU145, OVCA-R 
NCI-H345, RPWE-2, EB 
CALU6, RPWE-2, OVCA-R 
NCW460,NCW58, NCW520 
PRSCjcon, NCI-H460, RPWE-2 
MCF7, OVCA-R, PC3 
PRSC_con, PRSCJog, RPWE-2 
RPWE-2, NCI-H69, NCI-H345 
CALU6,DU145,A549 
N0358.NCI-358, NCI-H23 
CALU6,HT29,A549 
OVCA-R Caco2, MB-MDA-231 
NCW23.NCW520.NCI-358 
NO-H69, NCWi345, PRSC.Con 
NCi-358,NCI-H23.NCW520 
DU145.PRSC.con, RPWE-2 
OVCA-R, EB.BT474 
MBJWDA-231,NCW345,EB 
RPWE-2, PRSCJog, NCI-H345 
NCI-H23,NCW460,NCI-H520 
MB4UDA-231, RPWE-2, PRSCJog 
RPWE-2, EB, PRSCjcon 
DU145, EB, LnCap 
OVCA-R, DU145, MB-MDA-231 
M&MDA453, MB-MDA-435S, MCF7 
NCkH23,HT29,DU145 
EB,CALU6.Caco2 
MBJWDA-231,BT474,HT29 
KT29.NCI-358, MB-MDA-231 
EB, OVCA-R, MB-MDA453 
NCI-H69,NCI-H345, PRSCJog 
LnCap,DU145,DU145 
HT29,NCI-H23,NCI-H23 
NCI-H69,NCW460,NCi-H23 
PRSCJog, PRSCjcon, PRSCjan 
A549,DU145,EB 
NCW56,NCI-H460,BT474 
NCI-H69.PRSC_con, PRSCJog 
UCap,NCI-H23,NCI-H520 
OVCA-R, DU145, EB 
CALU6, M&MJA-435S, M&MDA-453 
EB, DU145, OVCA-R 
OVCA-R EB,A549 
PRSCJog, PRSCjan, NCI-H69 
NCI-H69, NCI-H345, PRSCJog 
NCI-H23,NCI-H69,NCI-H520 
NCI-HUS.PRSCjxm, PRSCJog 
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321308 AK47480 Hs.117029 ESTs 2.37 

323578 AA299492 Hs.168166 ESTs 2.37 

335747 CH22LFGENES.6O1J0 2.36 

322362 AF039697 EST duster (not in UniGene) 2.36 

314430 N76302 Hs.78110 ESTs; Weakly similar to F17A9.2 [C.etega 2.36 

304631 AA586422 EST singleton (not in UntGene) with exon 2.36 

337432 CH2£_F6ENES.765-1 2.36 

305984 AA887654 EST singleton (not in UniGene) with exon 236 

313486 AW134523 Hs.247186 ESTs 2.36 

303028 A1BB9109 Hs.212032 EST 2.36 

318292 A1679966 Hs.150603 ESTs 2.35 

334198 CH2LFGENES.354J 2.35 

314458 A1217440 Hs.143873 ESTs 2.35 

333346 CTC2LFGENES.139J5 235 

325408 CR12_hsgl|5866921 Z35 

313758 AA076743 Hs.129770 ESTs 2.35 

309825 AW293701 EST singleton (not in UniGene) with exon 2.35 

303536 R55497 Hs.183941 ESTs; Moderately similar to H beta 58 ho 2.35 

331534 N51583 Hs.133756 EST 2.35 

325164 T16981 Hs^1963 ESTs 2.34 

327710 CH34_hs gi|5867860 2.34 

306351 AA961356 EST singleton (not in UniGene) with exon 2.34 

304968 AA614308 EST singleton (not in UniGene) with exon 2.34 

334015 CH2a.FGENES.313J 234 

318315 AI091370 Hs.134852 ESTs 233 

306809 A1057134 EST singleton (not in UniGene) with exon 233 

337697 CH22LEMAC000097.GENSCAN.66-1 233 

329630 CH.11_p2gi|6729060 233 

326577 CH.19_hsgi|5867317 233 

333428 CH22^FGENES.149J 233 

301080 AM79391 Hs.155405 ESTs; Weakly simflar to Hi! ALU SUBFAMI 233 

324629 AA714311 EST duster (not in UniGene) 233 

302776 AJ133798 EST duster (not in UniGene) with exon h 232 

325801 CH.14_hsgil6552451 232 

332122 AA609698 Hs.1 12389 ESTs 232 

314167 AA243633 Hs.208983 ESTs 232 

324023 AA669615 Hs214226 ESTs 231 

320503 NM_00589 EST cluster (not in UniGene) 231 

312217 T98289 EST cluster (not in UniGene) 231 

321304 AA078293 EST cluster (not in UniGene) 231 

323517 AA527359 Hs.154366 ESTs 231 

336455 CH22_FGENES.829_13 2.31 

313352 AW292127 Hs.144758 ESTs 231 

331457 H93135 Hs.41840 ESTs 231 

333054 CH22_FGENES.73_8 231 

308598 AI719237 EST singleton (not in UniGene) with exon 231 

327059 CR21_hsgiI6531985 23 

334120 CH22.FGENES.333J 23 

324154 AI457449 Hs.192817 ESTs 23 

326509 CH.19Jisgi|6682496 23 

316855 AW291384 Hs.254974 ESTs 23 

337918 CH22_EM^C005500.GENSCAN.6M 23 

317471 AI825351 Hs.144084 ESTs 229 

331023 N32599 Hs.5856 ESTs 229 

332231 N48008 Hs.102629 EST 229 

309912 AW339671 EST singleton (not in UniGene) with exon 229 

316427 AI241019 Hs.145644 ESTs 229 

313329 AW293704 Hs.122658 ESTs 229 

335019 CH2U=GENES.474J 2J29 

324394 F20654 Hs.152128 ESTs; Moderately similar to (Hi ALU SUB 229 

339357 CH22-BA354112GENSCAN.31.2 2.29 

322128 AJ346033 EST cluster (not In UniGene) 228 

301310 AI239457 Hs.1 30794 ESTs 228 

300623 AI929130 Hs.1 18261 ESTs; Moderately similar to finger prate 2.28 

323409 AL135534 EST cluster (not in UniGene) 2.27 

308406 AI634885 EST singleton (not in UniGene) with exon 227 

322518 AI133446 EST cluster (not in UniGene) 227 

338381 CH22JM^C005500.GENSCAN.330-10 2,27 

316003 AA704584 Hs.1 19993 ESTs 227 

307090 Al 161024 EST singleton (not in UniGene) with exon 2.27 

300356 AA758411 Hs.1 21 335 ESTs 227 

331887 AA431328 Hs.98660 ESTs 227 

330951 H02566 Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 

305547 AA773111 EST singleton (not In UniGene) with exon 227 

312457 AA776743 Hs.191589 ESTs 226 

333929 CH22JGENES.300J 226 



BT474 ( NCW69,HT29 

LnCap, EB, MB-MDA-453 

NCI-H69, LnCap, PRSCjxxi 

DU145,PRSCj»n,NCI-H345 

DU145,MB4tfDA453,CALU6 

NCW23,NCI-H460,CALU6 

MB-MDA-231,BT474,HT29 

DU145,HT29,CALU6 

DU145,A549,CALU6 

NCWSB.NOHSM, NOH23 

NCm460,Caco2NCI-H23 

NCMH69 r PRSCJog, PRSC.con 

Caco2,A549,PC3 

CALU6, DU145, LnCap 

NCW460,NCI-H520,NCI-H23 

NCI-H23, M&WDA-435S, NCI-H345 

NCl-H460,NC^H23 ( NCi-H520 

DU145.CALU6.NCW520 

NCI-H23, NCkH520 ( NCI-358 

NCW45, PRSCJog, NCI-H460 

BT474,MB4tfDA-231 > NCI-H345 

BT474, MB-MDA-231, MB-MDA-435S 

CALU6, KT29,MB^DA453 

rH29, MB-MDA-231, BT474 

CALU6,NCM520,DU145 

PC3.DU145.EB 

RPWE-2, PRSCJog, NOH345 

NO-H520.NCI-H23.NCW460 

NCI-H460, NCI-358, NCI-H23 

NC1-H345, PRSC.con, RPWE-2 

OVCA-R,MCF7,MCF7 

NCW460, NCI-358, NCI-H23 

NCi-H23,NCi-H460, NCW520 

PRSCJog, MCF7.NCI-H23 

DU145.HT29.PC3 

DU145.MCF7.PC3 

DU145,NCI-H345,EB 

A549.OVCA-R.PC3 

NCW23,Caco2,NCW69 

DU145,OVCA-R,EB 

NCW345,DU145,EB 

NCI-H345, PRSCjxm, RPWE-2 

MCF7,DU145,OVCA-R 

Caco2,NC!-H460,NCi-H23 

NCI-H69, NCI-358, NCI-H23 

OVCA-R,CALU6,Caco2 

NCI-W60,UiCap, LnCap 

NCI-H69, RPWE-2, MB-MDA-435S 

NC1-H460, MB-MDA453, NCI-358 

NCI-H345,CALU6,OVCA-R 

NCi-H345,NCW*460,BT474 

RPWE-2 NCI-H345, PRSCJog 

KT29,OVCA-R,DU145 

OVCA-R, LnCap, A549 

CALU6, DU145, EB 

MB-MDA435S, PRSC.con, NCI-358 

Caco2, HT29, EB 

OVCA-R, DU145,Caco2 

HT29,CALU6,MB^DA-231 

NCI-H345, MB-MDA-231, RPWE-2 

NCW69, OVCA-R, BT474 

NCI-H23, NO-H520.NCI-H460 

OVCA-R, DU145, MB-MDA-231 

BT474, RPWE-2 PRSC.con 

NCI-H345, NCI-358, Caco2 

OVCM*,EB,H729 

DU145, MB-MDA-435S, OVCA-R 

NCI-H69, PRSC.con. PRSCJog 

NCI-358, NCI-H520, NCI-H23 

NCI-H345, DU145, RPWE-2 

LnCap, NCI-H460,Caco2 

NCI-358. NCI-H520.CALU6 

227 OVCA-R BT474.BT474 

LnCap, DU145,BT474 

NCI-H345, RPWE-2, PRSC.con 

HT29,CALU6,EB 
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319845 
306739 
306919 
333312 
334956 
312295 
307643 
324252 
309767 
311492 
312260 
327125 
316919 
316361 
315772 
320236 
315444 
333903 
335234 
333727 
332002 
329611 
310559 
327315 
323170 
331522 
313261 
312740 
325055 
337895 
307140 
321643 
302663 
322644 
330415 
302334 
326710 
323561 
337706 
339309 
330436 
312360 
301855 
331192 
315872 
337904 
308258 
320965 
333910 
300707 
336011 
325712 
322738 
335339 
320733 
319412 
337132 
301544 



AA649011 
A1028393 
AI096832 



Hs.187902 



AA578233 Hs.173863 
AI302124 
AA421989 
AW271805 
AW410240 
H05392 



l Hs.4437 
Hs.230597 



AAB45382 
AW33833 
AW515373 



Hs.204520 
Hs.164159 
Hs.158893 



AW138821 H&221737 



AA482009 
A1783594 
U83527 



AA730472 

R97191 

Z44631 

AI185762 

W76005 

X85153 

AA340904 

D83777 

AF120491 

AA825426 



Hs.105104 
Hs.155718 



Hs.117012 
Hs.142805 
Hs.134106 
Hs.21658 



Hs.32094 
Hs.75137 
Hs.238832 



ESTs 

EST singleton (not in UnlGene) with exon 

EST singleton (not in UnlGene) with exon 

CH22.FGENES.138J 

CH22_FGENES.465.24 

ESTs 

EST singleton (not in UnlGene) with exon 

ESTduster(notinUruGene) 

EST singleton (not in UnlGene) with exon 

ribosoma) protein L28 

EST 

CH.21Jisgi|6531971 
ESTs 

ESTs; Weakly similar to lili ALU SUBFAMI 
ESTs 

EST cluster (not in UnlGene) 
ESTs 

CH22.FGENES794J 
CH2*JGENES.515_3 
CH22_FGENES756_1 
ESTs 

CH.10_p2gi|3962478 
ESTs 

CH.01_hsgi|5867508 

EST dusterjnot in UniGene) 

ESTs 

ESTs 

ESTs 

ESTs 

CH22_E(AAC005500,GENSCAN.56.2 
EST singleton (not in UnlGene) with exon 
ESTs 

EST duster (not in UniGene) with exon h 
EST duster (not in UniGene) 
KIAA0193gene product 
EST duster (not in UnfGene) with exon h 



HG2724^H 

AI922972 Hs.196073 
AF053356 
T55182 
AW051819 

AI565612 
H16166 



Hs.152571 
l Hs704516 



AA080921 



AF201832 

AA738436 
AA679426 



Hs.134407 
Hs.187505 



AJ951651 Hs724290 



338280 
311421 
330638 
326603 
319055 
335451 
317989 
322024 
300734 
304022 
330082 
312516 
333932 
308115 
320184 
324432 
320882 



AI701635 
X89576 

AA412305 

AI203009 
AA334384 
AW205197 
T02990 



Hs.207077 
Hs.159581 



Hs.130664 
Hs740951 



AA363245 Hs.189831 
AI479071 

U91510 Hs.123036 

AA464510 

AI832098 



ESTs; Weakly simitar to liil ALU SUBFAMI 
CH2?_EMAC000097.GENSCAN^7-11 
CH22_BA354M2.GENSCAN.22-7 
Oncogene Tls/Chop, Fusion Activated 
ESTs 

multiple UniGene matches 

ESTs; Highly simflarto IGF-il mRNA-bmd 

ESTs 

CH22_EI*AC005500.GENSCAN^6-17 

EST singleton (not in UniGene) with exon 

EST cluster (not in UniGene) 

CH2*/GENES795_3 

EST cluster (not in UniGene) with exon h 

CH2LFGENES.668.9 

CH.14_hsgil6682473 

EST cluster (not in UniGene) 

CK22.FGENES.535J6 

ESTs 

ESTs 

CH22.FGENES.526-3 
ESTs 

CH.11_hsgl|5866903 
CH2ZJMACQ0550UGENSCM790-1 1 
ESTs 

matrix metaDoproteinase 17 (membrane-ln 

CH70.hsgI|6056312 

EST duster (not in UniGene) 

CH22_FGENES562_9 

ESTs 

EST duster (not in UnlGene) 
ESTs 

EST singleton (not in UniGene) with exon 

CH.19j>2gpi5314 

ESTs 

CH22_FGENES.300_5 

EST singleton (not in UniGene) with exon 

CD39-fte1 

EST duster (not in UnlGene) 
EST duster (not in UniGene) 
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1 n^nwi HI M AC U^ry 

LnCap, DU145, MCF7 


276 


km imt XQC Mf*l ICQ ^ Al 1 >C 

MB-MDA-43DS, NUhJOO, GALUb 


*> ntt 
Z2o 




276 


QVCA-R, DU145, PC3 


225 


RPWt-Z, rrtoC.COn, NCI-no40 


275 


OVCA-K, DU 145, NCI-H345 


2.25 


CALUo, CALUo, OVGA-R 


2.25 


nwni n CD AC Aft 

OVCA-R, EB, A549 


2.25 


DU145, NCI-H460, CALU6 


2.25 


NCI-H69, NCI-H460, NCI-H520 


2.25 


Caco2, EB, DU145 


2.25 


HT29, NCW58, BT474 


2.24 


NCI-H23, NCI-H345, NC1-H520 


2.24 


DU145, EB, PC3 


2.24 


OVCA-R, EB, LnCap 


2.24 


i|A| a P . KIM IF 1MI 1 KM 

NCI-358, DU145, NCI-H23 


274 


NCI-358, CALU6, PRSC_oon 


2.24 


MB-MDA-231, BT474, A549 


274 


n^i Lion ODor» ««. nooo i M 

NCJ-H69, PRSC.con, PRSCJog 


273 


MB-MDA-231, NCI-H69, BT474 


2.23 


EB, NCI-H520, HT29 


273 


BT474, HT29, MB-MDA-231 


272 


BT474, MCF7, MB-MDA-231 


272 


NCI-H69, EB, EB 


272 


EB, DU145, LnCap 


272 


A549, LnCap, DU145 


? 7? 


OVCA-R, PC3, LnCap 


2.22 


BT474, MCF7, OVCA-R 


2.22 


MB-MDM53, DU145, CALU6 


272 


NCI-H345, PRSCJog, PRSC.con 


272 


NCI-H520, NCMH460, EB 


271 


EB, NCI-H345, PRSC.con 


271 


BT474, MB-MDA-231, MCF7 


271 


NCI-H460, NCI-H23, NCI-H52U 


271 


Ail IM . r Af\ /"v ^ /\ 

CALU6,A549,Caco2 


271 


iiAi i ma biai i ia jr m^N^ 

NCm69, NCI-H345, PC3 


271 


NCI-H520, NCI-358, NCI-H23 


271 


NCW345, DU145, NCI-H69 


271 


MB-MDA435S, NCI-358, NCI-H520 


271 


BT474, HT29, PC3 


271 


PRSC.con, NCI-H69, Caco2 


271 


OVCA-R, MB-MDA435s, DU145 


27 


NCI-H69, HT29, NCI-H23 


27 


OVCA-R, PC3, CALU6 


27 


LnCap, OVCA-R, EB 


27 


OVCA-R, UCap, EB 


27 


DU145, M&MDA-231, CALU6 


27 


DU145, EB, LnCap 


27 


DU145, MB-MDA-231, EB 


27 


BT474,MCF7,HT29 


2.19 


NCI-H460, BT474, NCI-H345 


2.19 


NCI-H460, NCI-H23, NCI-358 


2.19 


PC3, RPWE-2, PRSC.con 


£19 


HT29, PRSCJog, MCF7 


2.19 


DU145, EB, Caco2 


2.19 


NCI-H345, PRSCJog, PRSC_con 


2.19 


NCI-H69, NCI-H345, PRSC.con 


2.19 


PRSC.con, MB-MDA-231, NC1-H23 


2.18 


PRSC.con, PRSCJog, MB-MDA-231 


2.18 


PC3, NCI-358, HT29 


2.18 


RPWE-2, NCI-H345, NCI-358 


2.18 


HT29, MB-MDA-435S, MB-MDA-453 


2.18 


CALU6, DU145, KT29 


2.18 


A549, OVCA-R, MB-MDA-435s 


2.18 


m HA*. 1 nCstn TAt 1 Ifi 
UU143, UIOop, UnLUO 


2.18 


NCi-H345,NCI-H69,NCm520 


2.18 


Caco2,PC3,NCI-H520 


£18 


NCI^58,A549,EB 


2.18 


NC1-H23,NCW58,NCI-H460 


£16 


NCl-H23,Caco2, Caco2 


£18 


BT474,HT29,MB4i4DA-231 


£17 


PC3,Caco£EB 


£17 


BT474, OVCA-R, OVCA-R 


£17 


NCW520, NCI-358, NCI-H23 


£17 


CALU6, RPWE-2, HT29 


£17 


OVCA-R, PC3.BT474 
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312251 H03952 EST duster (not in UnlGene) 2.17 

315049 AW340486 Hs.121210 ESTs 2.17 
305018 AA6271Z7 EST singleton (not in UntGene) with exon 2.17 

303807 AI792785 Hs.130434 ESTs 2.16 

317792 AI653389 Hs.196121 ESTs 116 

321666 AA872730 Hs.125229 ESTs 2.16 

328863 CR07_hsgq6381929 2.16 

319373 R00371 EST cluster (not in UnlGene) £16 

320069 T86541 Hs.189732 ESTs 2.16 

320235 AF064OS0 Hs.1 29708 tumor necrosis factor (Bgand) superfami 2-16 

338880 CH22JDJ32J10.GENSCAN.W 116 

318314 AI091349 Hs.161133 ESTs 2.16 
332696 D86973 Ms.75354 GCN1 (general control of amino-acid synt 2.16 

331352 AA406133 Hs.7482 WAA0682 gene product 2.16 

339019 CH22J)A59H18.GENSCAN.21-15 2.15 

306975 AI127042 EST singleton (not in UniGene) with exon 115 

318069 AI024557 Hs.131540 ESTs 115 

312997 AW205686 Hs.135130 ESTs 2.15 

331372 AA433935 Hs.55044 DKFZP586H2123 protein 2.15 

335049 CH2i.FGENES.481J 2.15 

324280 AA429772 Hs.191610 ESTs 2.15 

330363 CHXp2giJ3126882 2.15 

322896 AW470296 Hs. 144830 ESTs 115 

321981 AA946204 Hs.127361 ESTs 2.15 

333294 CH22LFGENES.130_6 2.14 

330170 CH.02_p2giI6648220 2.14 

312973 AI123346 Hs.135241 ESTs 2.14 

311104 AI627352 Hs.201449 ESTs 2.14 

325086 T10019 Hs.4194 ESTs 2.14 

317182 AW183524 Hs.192298 ESTs 2.14 

323644 AA310711 Hs.124340 ESTs 2.14 
308092 AI474896 EST singleton (not in UniGene) with exon 2.14 

322265 AF0B6244 EST cluster (not in UnlGene) 2.14 

303521 AA746272 EST cluster (not In UnlGene) wflh exon h 2.14 

312102 AW439340 Hs.189720 ESTs 2.14 

316559 AI24946B Hs.228251 EST 2.14 

338486 CH22LEMAC005500.GENSCAN.382^ 2.14 

301302 AI825444 Hs.210956 ESTs 2.14 

310591 AI650372 Hs.195979 ESTs 2.14 

316231 AA732301 EST cluster (not in UniGene) 2.14 

326559 CH.19J*g^5867310 2.14 

324062 AA525291 Hs.204099 ESTs; Weakly similar to !!il ALU SUBFAMI 2.13 

323844 AI811303 Hs.143490 ESTs 2.13 

333895 CH22.FGENES.293J 2.13 

308264 A1567114 Hs.171454 EST 2.13 
306081 AA908472 EST singleton (not in UniGene) with exon 2.13 

333101 CH22LFGENES.79L6 2.13 

328544 CH.07_hsgl)5868486 2.13 

333355 CH22JGENES.141.6 2.13 

323397 AI524519 Hs. 239 599 ESTs 2.13 
305697 AA814956 EST singleton (not in UniGene) with exon 2.13 

327809 CH.Q5JtsgiI5867968 2.13 

325092 T10115 HsS2423 ESTs 2.13 

322299 AI971935 Hs252784 ESTs 113 

312145 AA029526 Hs.1 2570 6 ESTs 2,12 

323704 AA319421 Hs.193577 ESTs 112 

328971 CH.08J*gl|64788Q8 2.12 

325338 CH.11Jisgi|5e66883 2.12 

331332 AA282554 Hs.89034 ESTs 2.12 

327159 CH.01_hsgipa67550 112 

335180 CH2^_FGENES.505J 112 

338062 CH22.EMAC005500.GENSCAN.162-3 112 

318350 AI636018 Hs.135538 ESTs 112 

312070 AW293140 Hs.108790 ESTs 111 

328314 CH.07Jisgi|5868371 111 

315869 AI033547 Hs.132826 ESTs 111 

339246 CH2?_BA354I11GENSCAN.54 111 

329921 CH.16_p2gll6165205 111 

324981 Z25333 Hs.4947 ESTs 111 

331291 AA159323 Hs.109929 ESTs 111 

332729 AA058907 Hs.83190 fatly acid synthase 2.11 

325448 CRISIS gi|5866941 111 

314929 AW188286 Hs.143612 ESTs 11 

301063 AI057634 Hs.124596 ESTs 11 

301952 AB029016 Hs.117333 KIAA1093 protein 11 

326309 CR17Jisgl|5867277 11 



NCW460,NCI-H23,NCW58 

NOH520, NCI-358, NCI-H23 

MB-MDA-231, MB-MDA-453, EB 

NO-H345, PRSCcon, PRSCJog 

NCl-H345 ( PRSCj»n,UCap 

OVCA-R, PC3, MCF7 

PRSC_con,NCI-H345 I NCI+l460 

PRSCjxwi, RPWE-2, NCI-H345 

NCI-H23, NCt-358, NCI-H345 

NaH23,NCI-H460,NCI-H520 

BT474.MCF7, OVCA-R 

NCI-H23,NCI-H520,NCI-H460 

A549.PC3.DU145 

PC3, EB, MB-MDA-231 

UCap, EB, OVCA-R 

MB-MDA-435S, NCI-H520, NCW58 

Caco2, Caoo2, BT474 

NCW460,Na-H23,NCU58 

PRSCjXJn,HT29,CALU6 

NCW69.NCI-H345, PRSCJog 

MB-MDM53, MB-MDA-4358, MCF7 

NCI4^23.NCWW60,NCW58 

HT29, CALU6, EB 

M&MDA-231,DU145,HT29 

EB.DU145, MB-MDA-453 

HT29,MBJUDA453,PC3 

LnCap f DU145,EB 

NCW520,NCJ-H23,LnCap 

NCI-H460,NCI-H23, NCI-358 

HT29,BT474,MB-MDA-435s 

RPWE-2, PRSCjan, PRSCJog 

BT474.MCF7, MB-MDA-231 

NCI-H345, RPWE-2, PRSC_con 

DU145,M&MDA453,EB 

NCI-H23, NCI-H460, MB-MDA435S 

NCI-H460,NCW58 ( NCI-H23 

NCW520.NCI-H23,NCI-H69 

BT474.HT29.M&MDA-231 

CALU6, CALU6, Caco2 

NCW23,NCl-H520,NCU358 

DU145.NCI-H460.NCI-H23 

OVCA-R DU145.EB 

MB-MDA-453, MCF7, MB-MDA435S 

CALU6, UCap,DU145 

DU145.CALU6, MB-MDA-453 

HT29.BT474, MB-MDA-231 

NCI-H345, NCI-H69, PRSCJog 

NCJ-H23, NCI-H69, PRSCJog 

DU145,EB,CALU6 

EB, NO-H460.NCI-H345 

NOH520.NCI-H460, NCI-358 

HT29.PC3, OVCA-R 

HT29. NCI-358, MB-MDA-231 

PRSC_con ( DU145 l DU145 

OVCA-R A549, MB-MDA-435S 

CacQ2, LnCap, OVCA-R 

NCI-358, NO-H23.NC1-H520 

LnCap,NCW69,NCi-H345 

NCI-H520 f NCM423 J Caco2 

EB,DU145,PC3 

LnCap,NCI-H69,A549 

PRSC con, PRSCJog, NCI-H69 

EB.HT29,DU145 

Caco2 r NCI-H23, A549 

HT29,NCl-H23.NCI-H460 

BT474,CALU6,MCF7 

CALU6,CALU6,BT474 

BT474, MB-MDA-231, HT29 

A549,NCI-H460,NCI-H520 

NCWW ( A549,PRSC_con 

NCI-358. LnCap.MB^I/IDA-453 

DU145, MCF7, CALU6 

EB, BT474, MB-MDA-231 

NCW23,NCI-H460,BT474 

OVCA-R, A549, CALU6 

MB-MDA-435S, NCI-H69, M&MDA453 
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315406 AJ823453 Hs.146625 ESTs 2.1 

302376 AB007867 Hs.200480 WAA0407 protein 2.1 

312181 AA417281 Hs.191595 ESTs 2.1 

334254 CH22_FGENES.366_4 21 

318073 AW167Q87 Hs.131562 ESTs 21 

304724 AA569881 Hs.65114 keratin 18 2.1 

332359 W87704 Hs.211558 ESTs 2.1 

331884 AA431302 Hs.98721 EST ; Weakly similar to N-coplne [H^apie 2.1 

308228 AI559106 H&181165 eukaryotic translation elongation factor 2.1 

324279 AA501412 Hs.191688 ESTs; Weakly similar to Pro-PoWUTPase 2.09 

3372)3 CH22L.FGENES.591-3 2.09 

322346 AA227618 Hs.10882 HMG-box containing protein 1 2.09 

304470 AA426654 Hs.195168 gr^raldehyd&^hosphatedBhyoVogenase 2.09 

325977 CH.16_hs gi|6249602 2.09 

304696 AA554758 EST singleton (not in UniGene) with exon 2.09 

317412 AI301528 Hs.132604 ESTs 2.09 

315570 AI860360 Hs.160316 ESTs 208 

327341 CKOIJis gpi7016 2.08 

327431 CH.02Jisgi|5867754 2.08 

314685 AI870811 Hs.158709 ESTs; Weakly similar to KIAA0938 protein 2.08 

328624 CH.07_hsgij5868246 2.08 

303596 AW303377 EST duster (not in UniGene) with exon h 2.08 

336717 CH22.FGENES.81-1 2,08 

317370 AW204139 Hs.174424 ESTs; Weakly simflar tap140mDia[M.musc 208 

331287 AA149061 Hs.172971 ESTs 208 

304211 N62228 EST singleton (not m UniGene) with exon 2.08 

315613 AW137420 Hs.192311 ESTs 2.08 

325636 CH.14_hsglj5867002 2Q8 

336406 CH22_FGEhES^23J1 208 

301714 F06529 EST duster (not In UniGene) with exon h 2.08 

300496 R45159 Hs.221804 ESTs 108 

318970 R21114 Hs21383 ESTs 208 

334115 CH22LFGENES.330J5 208 

308082 AI473682 EST singleton (not in UniGene) with exon 208 

308282 AI569456 EST singleton (not tn UniGene) with exon 2.08 

313038 AW451618 Hs.124195 ESTs 207 

317974 AW444468 Hs.144900 ESTs 207 

324063 AW292740 Hs.254815 ESTs 2.07 

334759 CH22_FGENES.428_8 207 

307864 AI367417 EST singleton (not in UniGene) with exon 2.07 

304356 AA196027 Hs.195168 glyceraldehyde4-phosphate dehydrogenase 207 

303929 AW470753 EST singleton (not in UniGene) with exon 2.07 

331857 AA421160 Hs.9456 SWI/SNF related; matrix assocd; acrJn de 207 

322814 AI824495 Hs.211038 ESTs 206 

303650 AA430709 EST duster (not in UniGene) with exon h 206 

333403 CH22_FGENES.144^21 206 

313663 AI953261 Hs.169813 ESTs 206 

338594 CH22JMAC005500.GENSCAN.4354 206 

334676 CH22_FGENES.418.29 206 

310046 AI198032 Hs.210356 ESTs 2.06 

309169 AI949216 EST singleton (not in UniGene) with exon 206 

329752 CH.14_p2gi|6065777 206 

325085 T10001 Hs.4168 ESTs 206 

332062 AA521016 Hs.185375 ESTs 206 

302074 AA382871 Hs.132794 phosphate cytidylyltransferase 1 ; cholln 206 

326344 CH.17_hs giJ6525295 206 

330855 AA079318 zm98c2s1 Stratagene colon HT29 (#937221 

IMAGE545954 3', mRNA seq 206 

302525 AFO24690 Hs.248056 G protein-coupled receptor 43 205 

331903 AA436673 Hs.29417 H sapiens mRNA; cONA DKFZp586B0323 (from 

316322 AW296618 Hs.120637 ESTs 205 

321525 H78875 EST duster (not in UniGene) 205 

305071 AA640579 EST singleton (not in UniGene) with exon 205 

326033 CH.17Jisgi|5867178 205 

334730 CH22.FGENES.424.5 205 

305335 AA704235 EST singleton (not in UniGene) with exon 205 

320521 N31464 Hs.24743 ESTs 2.05 

333515 CH22_FGENES.172_5 2.04 

311020 AI918672 Hs.213783 ESTs 204 

324323 AA393739 EST cluster (not in UniGene) 204 

305486 AA748889 EST singleton (not in UniGene) with exon 204 

312162 T91823 EST duster (not In UniGene) 204 

330980 H28794 Hs.6659 ESTs 204 

317463 AA927290 Hs.130462 ESTs 204 

303460 AA700155 Hs.1 17900 ESTs 204 

337435 CH2*_FGENES.766-2 203 



OVCA-R DU145.EB 

OVCA-R.Caco2.HT29 

OVCA-R A549.DU145 

LnCap, OVCA-R DU145 

A549.CALU6.EB 

NaH23,NCI-H520,NCI-H460 

MB-MDA-435S, PRSC.con, NCI-H460 

NCUH345, MB-MDA-231. PRSC.con 

EB, CALU6. OVCA-R 

OVCA-R LnCap, PC3 

NOH69, NCI-H345, MB-MDA-231 

HT29 ( BT474.MBM)A-231 

NCW23,CALJU6 ( NCI^520 

NCm23.NCW520.HT29 

MB*DA435s,NCI-H23,BT474 

Caco2, EB, NCI-35B 

PRSC.con, PRSCJog, NOH345 

MB4/IDA-231, PRSC.con, NCW69 

NCW23,NCW58,NCW520 

MB-MDA-453, MCF7, OVCA-R 

MCF7, NCW58, RPWE-2 

RPWE-2, PRSC.con, PRSCJog 

BT474.HT29.MCF7 

NCI-H23,NCI-H460 I NCl-H69 

OVCA-R, EB, NCI-H345 

BT474, MCF7, MB-MDA-231 

PRSC.con, PRSCJog, PRSCJog 

NC1-358,NCI-H460,MB-MDA453 

HT29,EB,OU145 

LnCap, PRSCJog, PRSC.con 

PRSC.con.UCap, RPWE-2 

NCW23,NCW520,NO-H460 

BT474.NCm69.HT29 

MB^DA435s. NCI-H345, MB-MDA-231 

LnCap ( EB, PRSC.con 

NCI-H345. PRSC.con, LnCap 

NCI-358,NCI-H23,NCW520 

Caco2,NCI-358,NCI-H520 

CALU6, HT29.NCW520 

NOH460,NCi-358,NCI-H23 

HT29,MCF7,M&M5A-435s 

NCW345, PRSC.con, RPWE-2 

EB.A549.PC3 

PRSC.con, RPWE-2, Caco2 

RPWE-2. NCI-H345, PRSC.con 

OVCA-R, CALU6.PC3 

NCI-H345, OVCA-R NCI-H23 

OU145, LnCap. EB 

NCI-H69, PRSCtog, PRSC.con 

MB-MDA-435S, NCI-H23, Caco2 

CAUU6.EB.NCI-358 

CAUU6,HT29,DU145 

EB. OVCA-R. rv©-MDA-435s 

OVGA-R, MB-MDA-453, MCF7 

LnCap, EB.NC1-H69 

HT29,BT474, MB-MDA-453 

RPWE-2, LnCap, PRSC.con 

NCU358.NCI-H23,DU145 

205 Caco2,DU145,A549 

BT474, MB-MDA-453, OVCA-R 

NCI-H23, PRSC.con, NCI-H520 

M&MDA-231,BT474,HT29 

HT29.DU145.BT474 

BT474.EB, OVCA-R 

MCF7, OVCA-R, MB-MDA-453 

M&M)A^53,MBM)A-231,PC3 

NC1-H345, RPWE-2. PRSC.con 

NCI-H460.NCW23.NCI-H520 

OVCA-R, PC3, LnCap 

NCI-H345, PRSCtog, CALU6 

NCI-H520. NCW23. NCI-358 

MCF7, MB4/IDA-453, MBWA435s 

NCm23,Caco2,NCI-H69 

DU145, EB, CALU6 

NCW345, OVCA-R LnCap 
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305484 AA742425 


EST singleton (not in UniGene) with exon 


2.03 


CALU6, NCh1520, NCY-358 


307918 AI383496 


EST singleton (not In UniGene) with exon 


2.03 


NCm23 ( BT474, MB-MDA-231 


322209 HB9360 


EST cluster (not in UniGene) 


Z03 


DU145, OVCA-R, MB-MDA-453 


310295 AW205198 Hs.149146 ESTs 


2.03 


NCI-H23, NCI-H460, NC W58 


325886 


. CH.16J*gtp867087 


2.03 


NCJ-H345, NCI-H345, RPWE-2 


329719 


CH.14_p2gP65785 


2.03 


NCW69, RPWE-2, PRSC.con 


309247 AI972768 


EST singleton (not in UniGene) with exon 


2.03 


LnCap, PRSC_oon, RPWE-2 


328277 


CH.07_hsgi|600447t 


2.03 


LnCap, RPWE-2, A549 


307296 AI205705 


Hs.147222 EST 


2.03 


NCI-H460, NCI-358, NC1-H23 


327203 


CH.01Jsgi|5867447 


2.03 


HT29, BT474, MB-MDA-231 


306866 AI086683 


EST singleton (not in UniGene) with exon 


2.03 


BT474, NCWH345, HT29 


333339 


CH22J=GENES.139J 


2.03 


HT29, DU145, CALU6 


323115 AI921875 


EST duster (not in UniGene) 


2.03 


BT474, BT474, MB-MDA-231 


304811 AA584361 


EST singleton (not in UniGene) with exon 


2.03 


NCI-H23, NCt-358, NCI-H460 


323372 AL135125 


Hs.13913 ESTs 


2.02 


DU145,EB t A549 


312854 AA828713 


EST duster (not in UniGene) 


2.02 


NCI-H345, PRSC.con, PRSCJog 


307904 AJ381019 


EST singleton (not in UniGene) with exon 


2.02 


HT29, MCF7, MB-MDA-453 


332099 AA60B983 


at5d4.s1 Scares JesfeJlHT H sapiens cDN 2.02 


PRSC.con, NCI-H345, RPWE-2 


324634 AI684571 


Hs.175831 ESTs 


2.02 


NCI-H460, Caco2, NCI-358 


335721 


CH22.FGENES599J4 


2.02 


NCI-H69, PRSCJog, NCI-H345 


312452 AI692643 


Hs.172749 ESTs 


2.02 


HT29, Caco2, MB-MDA-231 


325396 


CH.1*_hsgIj5866921 


2.01 


HT29, NCI-H520, NCI-H460 


328770 


CH.07Jsgi|6017031 


2.01 


NCI-H23, NCI-H460, NCI-358 


335585 


CH22L.FGBfcS.5BU4 


2.01 


M&M)Ar453, DU145, MCF7 


335634 


CH22LFGENES£84 14 


2.01 


NCI-H23, NCI-H460, NCI-H69 


338271 


CH22LEM^C005500.GENSCAN^87.1 


2.01 


MCF7,DU145,PC3 


328607 


CH.07 hsgl|5868233 


2.01 


NCI-H460, NCI-H23, NCI-358 


307050 AI147341 


Hs.146734 EST " 


2.01 


NCm520 ( NCW23,NCi-358 


334946 


CH22_FGBtfES.465_13 


2.01 


CALU6.BT474.DU145 


319793 R56360 


EST duster(not in UniGene) 


2.01 


uni mien utoo mm oca 


307223 AI193698 


Hs.184776 riboscmaJ protein L23a 


2.01 


NO358,NChH520,NCI-H23 


312627 AA344698 


Hs.133169 ESTs 


2.01 


PC3, LnCap, MB-MDA-231 


329221 


CHJOisgi}5868727 


Z01 


NCI-H345,NCI-H69,NCW58 


305145 AA653589 


EST singleton (not in UniGene) with exon 


2.01 


LnCap, EB.OVCA* 


328428 


CH.07Jisgil5B68417 


2.01 


NCi-HS9,MB-MDA453,BT474 


305990 AA868866 


Hs.125919 EST 


2.01 


NOH520,NCW58,NCI-H23 


319368 R00003 


Hs.133171 ESTs 


2 


OVCA-R, LnCap, PC3 


324805 AA9270Q2 


Hs.131350 ESTs 


2 


NCUW60,NCM23,NCI-35B 


301138 AA719179 


Hs.189419 ESTs - 
EST singleton (not in UniGene) with exon 


2 


NCW69,NOH23, PRSC.con 


304675 AA541740 


2 


NCW460, NCW520, MB-MDA-231 


325194 


CH.17J»gl|5867213 


2 


HT29, NCI-358, BT474 
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Table 5: H chip - B survivor vs Met query up in Mets 



Pksey: Unique Eos probeset identifier number 

ExAccn* Exemplsr Accession number) Goihsnlc secession Dumber 

UnigenetD: TJnigeoe number 

Unigene Title: Unigene gene title 



Pkey ExAccn UniGJD Complete Jffle Ratio Met/B surv. 

102193 U20758 Hs.313 secreted phosphoproteln 1 (osteopontln; 5.56 

128530 AA504343 Hs.183475 Homo sapiens done 25081 mRNA sequence 4.62 

129093 AA2G2710 Hs.108614 KIAA0527 protein 4.23 

124690 R05818 Hs.173830 ESTs 3.96 

115558 AA393806 Hs.1010 regulator of mitotic spindle assembly 1 3.39 

134261 AA227678 Hs.8084 Humn DNA sequence from clone 465N24 on c3.22 

104792 AA029288 Ks.29147 ESTs; Highly similar to ZINC FINGER PROT 3.17 

133770 M69197 Hs.242279 haptogtobltwelated protein 3.07 
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Table 6: H chip - B survivor vs Met query - down in Mets 



Pkey: Unique Eos probcset identifier number 

ExAccn: Exemplar Accession number, Genbanlc accession number 

UoigenelD: Unigene number 

Unigene Tide: Unigene gene title 



Pkey ExAccn UnlQJD CompieteJUle Ratio Met/Bsurv. 



100116 DO0B54 Hs.77443 actin; gamma 2; smooth muscle; enteric 0.07 

101923 S75256 HNL=neutrophB PpocaDn [human, ovarian 02 

129982 M87789 Hs.140 immunoglobufin gamma 3 (Gm marker) 02 

130064 T67053 Ha.181 125 immunoglobulin lambda gene cluster 0.2 
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Table 7: I chip - B survivor vs Met query - up in Mets 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigcnelD: Umgene number 

Unigene Tide: Unigene gene titie 



Pkey EiLAccn UnlGJD Tftle 



319379 
321920 
324302 
314522 
331433 
324643 
332471 
314915 
321354 
322309 
325100 
314071 
315178 
330987 



T91443 

N63915 

AA543008 

AI732331 

H68097 

AI436356 

AA416967 

AA573072 

AA078493 

AF086372 

T10265 

AA192455 

AW362945 



Hs.193963 ESTs 



Hs.136806 
Hs.187750 
Hs.161023 
Hs.130729 
Hs.12Q980 
Hs.187748 



Hs.116122 
Hs.168690 
Hs.162459 
Hs.131965 



319403 
331469 
331549 
331644 
313220 



T98413 
N22273 
N58866 
T99544 
AI971981 



Hs.39140 
Hs.237507 
Hs.173734 
Hs.1 18241 



ESTs; Weakly similar to III! ALU SUBFAMI 
ESTs; Moderately similar to ill! ALU CIA 
EST 
ESTs 

nuclear receptor co-represser 2 

ESTs; Weakly simBarto llil ALU SUBFAMI 

EST cluster (not in UniGene) 

EST cluster (not in UniGene) 

ESTs; Weakly simflar to coded for by C. 

ESTs 

ESTs 

ESTs; Weakly simitar to llil ALU SUBFAMI 

CH22_EMAC005500.GENSCAN.56-5 

EST cluster {not tn UniGene) 

ESTs 

EST 

ESTs; Weakly similar to till ALU CLASS B 
ESTs 



Ratio Met/B surv 

19.65 

115 

9.31 

5.79 

4.79 

4.59 

4.58 

4.3 

4.26 

3.89 

3.81 

3.74 

3.66 

3.51 

3.21 

3.2 

3.15 

3.14 

3.14 

3.04 



133 



WO 02/068677 



PCT7US02/06001 



Table 8: I chip - B survivor vs Met query - down in Mets 



Pkey: Unique Eos probes et identifier number 

ExAccn: Exemplar Accession number, Genbank secession number 

UirigenelD: Unigene number 

Unigene Tide; Unigene gene title 



Pkey Exjten UniGJD 


Title 


Ratio Met/B surv 


333658 


CH22_FGENES241_4 


0.06 


333657 


CH22J : GENES241J 


0.07 


333654 


CH22_FGENES240J 


0.07 


33263 


CH22_FGENES27_2 


0.07 


333656 


CH22_FGENES240_4 


0.07 


304480 AA430373 


EST singleton (not In UniGene) with axon 


0.08 


333737 


CH22J=GENES261J 


0.09 


308601 AJ719930 


EST singleton (not in UniGene) with axon 


0.1 


334030 


CH22_FGENES.320_2 


0.1 


333637 


CH22_FGENES229J 


0.13 


302347 AFO3940O Hs.194658 chloride channel; calcium activated; fam 


0.16 


333653 


CH22_FGENES239_2 


0.16 


333635 


CH22.FGENES228J 


0.19 


333647 


CH22_FGENES235_2 


0.19 


307588 AI285535 


EST singleton (not In UniGene) with axon 


02 


337954 


CH2^_EMAC005500.GENSCAN.96^3 


02 


333588 


CH22_FGENES208_2 


0.21 


320244 AA296922 Hs.129778 gastrointestinal peptide 


0i2 


333642 


CH22_FGENES231_2 


023 


337951 


CH22_EMAC005500.GENSCAN.94-1 


023 


333730 


CH22_FGENES.258J 


0.23 


333646 


CH22_FGENES.234_2 


024 
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Table 9: H chip - B survivor vs Met query - up in Mets 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

\ JnigendD. Unigcnc number 

Uragene Title: Unigene gene title 



Pkey ExAccn UniGID Complete Jttte Median Mets AJ vs Median B-Sur A! 

100655 HG2841-HT2970 Albumin, Aft. Splice 5 11.98 

124875 R70506 Hs207693 ESTs; WeaHy similar to ALU SUBFAMI 9.21 

102193 U20758 Hs.313 secreted phosphoproteln 1 (osteopontin; 6.73 

100854 HG2841-HT2969 Albumin, AIL Splice 3, NEsspEcing In AJbalbumin Venezia 6.18 

118828 N79496 Hs.50824 EST 5.93 

128046 AA873285 Hs.137947 ESTs 5.9 

128896 014446 Hs.107 fibrinogen-Gke 1 5.17 

127917 AA211895 Hs.118831 EST; Highly similar to d J1 1 63J1 .2.1 [H.s 5.11 

125090 T91518 ye20fl)5.s1 Stratagene lung (#937210) Horn 4.47 

118579 N66905 small inducfele cytokine A5 (RANTES) 423 

123526 AA608657 ESTs;ModeralB)ysimBartoll]IALUSUB 421 

128062 AA379500 Hs.193155 ESTs 4.14 

119174 R71234 yr54c08^1 Scares placenta Nb2HP Homo sa 4.1 1 

128530 AA504343 Hs. 183475 Homo sapiens clone 25081 mRNA sequence 4.09 

119404 T92950 ye27c10.s1 Stratagene lung (#937210) Horn 3.98 

118475 N66845 Hs/165411 ESTs; WeaWy similar toll!! ALU CLASS B 3.96 

129974 K00629 Hs.199300 Human kpni repeat mma(cdna done pcd-k 3.87 

108888 AA135606 Hs.189384 ESTs; Weakly similar to IUI ALU SUBFAMI 3.85 

123963 C13961 Hs210115 EST 3.8 

123523 AA60B588 Hs.193634 ESTs 3.76 

128230 AA984074 Hs.176757 ESTs 3.75 

124090 H09570 Hs.143032 ESTs; Weakly similar to neuronal thread 3.67 

124690 R05818 Hs.173830 ESTs 3.58 

134261 AA227678 Hs.8084 Human DNA sequence from done 465N24 on 3.57 

126917 AA176225 Hs.193929 ESTs 3.52 

126050 H27267 Hs.75860 rn/droxyacyWoenzyme A aerrydrogenase/3-k 3.45 

126649 AA855990 Hs.125058 ESTs 3.42 

115(^6 AA255991 Hs.175319 ESTs 3.4 

129906 H39216 Hs.239970 ESTs; Weakly similar to ZNF91L [H^apien 338 

123022 AA480909 aa28f10.s1 NCLCGAPJ3CB1 Homo sapiens cD 338 

106145 AA424791 H&5734 WAA0679 protein 338 

125191 W67257 Hs.138871 ESTs; Weakly similar to fill ALU CLASS B 3.36 

108836 AA132061 H&222727 ESTs; Weakly similar to ubiquitous TPR m 33 

128710 J04813 Hs.104117 cytochrome P450; subfamily IIIA (niphedi 327 

123460 AA598981 Hs.251122 EST 325 

133735 AC002045 Hs251928 nuctear pore complex Interacting protein 324 

124696 R06273 Hs.186467 ESTs; Moderately simfiar to III! ALU SUB 324 

120748 AA303153 Hs237994 EST; Weakly similar to III! ALU SUBFAMIL 321 

133770 M69197 Hs.242279 haptoglobirHBlated protein 3.17 

128336 AI242720 Hs.146043 ESTs; WeaWy similar to alternatively sp 3.14 

135357 AA235803 Hs.79572 cathepsin D (lysosomal aspartyl protease 3.12 

128088 R02443 Hs.186467 ESTs; Moderately similar to I! II ALU SUB 3.08 

124055 F10904 Hs.100516 Homo sapiens done 23605 mRNA sequence 3.06 

124896 R62063 Hs.101594 EST 3.06 

127598 AA610677 Hs.168851 ESTs 3.04 

116802 H44061 Hs.1 94026 ESTs 3.01 
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Table 10: H chip - B survivor vs Met query - Down in Mets 



Pkey: Unique Eos probcset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UmgenelD: Unigene number 

Unigene Title: Unigene gene tide 



Pkey ExAccn UniGJD Comptetejntte Ratio Met/B surv. 

100116 DQ0654 Hs.77443 acfin; gamma % smooth muscle; enteric 0.09 

130064 T67053 Hs.181 125 lrrarttim)globuBn lambda gene cluster 0.11 

129982 M87789 Hs.140 immunogfobufin gamma 3 (Gm marker) 0.12 

131219 CC0476 Hs.24395 small Inducible cytokine subfamily B{Cy 0.13 

133806 M12759 Hs.76325 Human Ig J chain gene 0.17 

132962 L02326 Ha.198118 immunoglobulin tembdaJIkB porypeptide 2 0.18 

131713 X57809 Hs.181125 imnwnoglobuiin lambda gene cluster 0.18 

131791 S71043 Hs^2225 immunoglobulin alpha 1 02 

133725 V00563 Hs.179543 immunoglobulin mu 022 

101923 S75256 HNL=neuirophO lipocaDn [human, ovarian 023 

101461 M22430 Hs.76422 phosphoDpaseA2; group IIA (platelets: 024 

103448 XS9133 H&204238 lipocaltn 2 (oncogene 24p3) 024 
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Table 11: H chip - Met vs Normal query - up in Mets 



Pkcy: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigencID: Unigene number 

Um'geoe Title: Unigene gene title 

Pkey ExAccn UniGJD Complete.Title Median Mets Al vs Median Normal AJ 

00855 HG2841-HT2970 Albumin, Alt Splice 5 15.91 

02193 U20758 Hs.313 secreted phosphopratEin 1 (osteopontin; 6.83 

24875 R70506 Hs.207693 ESTs WeaklysimJIartollll ALU SUBFAMI 6.68 

00654 HG2841-H72969 Albumin, AIL Splice 3 r Msspficing In AltoaJbumin Venezia 5.28 

24059 F13673 Hs.99769 ESTs 5.11 

28896 D14446 Hs.107 fibrinogen-like 1 5.05 

34453 X70683 Hs.83484 SRY (sex determining region Y)-box 4 4.82 

31564 AA491465 Hs.28792 ESTs 4.78 

27917 AA211895 Hs.118831 EST; Highly similar to dJ1163J1 2.1 ^ 4.76 

15096 AA255991 Hs.175319 ESTs 4.67 

04558 R56678 Hs.88959 Human DNA sequence from done 9S7N21 on 4.63 

23526 AA608657 ESTs; Moderately similar to ALU SUB 4.61 

25090 T91518 ye2Qf05.s1 Stratagene lung (#937210) Horn 4.59 

I29666 M77349 Hs, 118787 transforming growth factor; beta-induced 4.58 

18828 N79496 Hs.50824 EST 4.56 

28046 AA873285 Hs.137947 ESTs 4.45 

33421 AA436560 Hs.7327 claudinl 4.09 

29158 J05257 Hs.109 dipeptidase 1 (renal) 4.04 

28062 AA379500 Hs.193155 ESTs 4.03 

I24696 R06273 Hs.186467 ESTs; Moderately similar to lill ALU SUB 4.01 

18475 N66845 Hs.165411 ESTs; Weakly similar to III! ALU CLASS B 3.96 

04755 AA024482 Hs.9029 DKFZP434G032 protein 3.83 

04978 AA088458 Hs.1 9322 ESTs 3.74 

18579 N68905 small fndudble cytokine A5 (RANTES) 3.7 

I23796 AA620390 Hs247444 ESTs 3.62 

27240 AA868387 Hs.243845 ESTs; Moderately similar to HI! ALU SUB 3.61 

04105 AA422123 Hs.42457 ESTs 3.55 

29349 D86974 Hs.1 10613 KIAA0220 protein 3.54 

19329 T51832 ESTs; Moderately simitar to !!!] ALU SUB 353 

14617 AA084146 Hs.110659 ESTs 3.52 

23143 AA487595 aa95e2*1 Stratagene fetal retina 93722 3.48 

03119 X63629 Hs.2877 cadherin 3; P-cadherin (pfecemai) 3.48 

19404 T92950 ye27c10.s1 Stratagene lung (#937210) Horn 3.47 

23963 C13961 Hs210115 EST 3.47 

16480 C14088 Hs.195188 glyceraldehyde^-Dhosphate dehydrogenase 3.4 

08836 AA132061 Hs222727 ESTs; Weakly similar to ubiquitous TPR m 3.39 

20748 AA303153 Hs237994 EST; Weakly similar to 111) ALU SUBFAMIL 3.38 

33770 M69197 Hs.242279 haptoglobin-related protein 3.38 

32358 X60486 Hs.46423 H4 histone family; member G 3.37 

27759 A1369384 arybufatase D 337 

I29095 L12350 Hs.108623 thrombospondln 2 3.37 

28261 AJ081213 Hs.13179 ESTs; Moderately similar to llil ALU SUB 336 

26906 AA169866 ESTs; Weakly similar to 11)1 ALU SUBFAMI 336 

28954 N3211B Hs209100 DKFZP434C171 protein 334 

19174 R71234 y!54c08.s1 Scares placenta Nb2HP Homo sa 333 

06687 AA463234 Hs.1 19387 KIAA0792 gene product 3.32 

28230 AA984074 Hs.176757 ESTs 33 

126649 AA856990 Hs.125058 ESTs 3.25 

24620 N74051 Hs.194092 ESTs; WeaWy similar to !!!! ALU SUBFAMI 3.24 

35427 AFFX control: human alu repeats 3.23 

29967 H99653 Hs.138618 ESTs 3.22 

25191 W67257 Hs.138871 ESTs; Weakly similar to lill ALU CLASS B 32 

124684 R02401 Hs221078 ESTs 3.2 

28010 AAB56953 Hs.23348 S-phase tase^ssoctated protein 2 (p45 3.17 

19423 T99544 Hs.173734 ESTs; Weakly similar to lill ALU CLASS B 3.16 

23022 AA480909 aa28f10.s1 NCI_CGAP_GCB1 Homo sapiens cO 3,15 

I03654 Z70759 Ksapiens mitochondrial 16S rRNA gene (p 3.13 

28336 AI242720 Hs.146043 ESTs; Weakly similar to alternatively sp 3.12 

24690 R05818 Hs.173830 ESTs 3.1 

29791 F02778 Hs.173887 WAA0876 protein 3.07 

14472 AA026924 Hs. 177407 ESTs; Weakly similar to III! ALU SUBFAMI 3.07 

15429 AA284139 Hs39295 EST 3.06 

30020 AA433930 Hs240443 ESTs; Weakly similar to HNK-1 suhotrans 3.06 

I26050 H27267 Hs.75860 hydroxyacyt-Coenzyme A dehydrogenase/3-k 3.05 
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1299% H39216 Hs.239970 ESTs; Weakly sfanOar to ZNF91L(H^apien 3.04 

123422 AAS98484 H&238476 EST 3.03 

103059 X57351 Hs.174195 interferon induced transmembrane protein 3.02 

1242S3 H69742 Hs.102201 ESTs 3.02 

123523 AA608588 Hs.193634 ESTs 3.02 

132669 AA188378 Hs.54602 ESTs; Weakly similar to 60S RIBOSOMAL PR3.02 

123196 AA489250 H&59403 serine palmitoyttfansferase; subunit II 3.01 

122948 AA477483 zu44h2.s1 Scares ovary tumor NbHOT Homo 3.01 

119053 R11501 yQBflJl Scares fetal liver spleen 1 NFL 3.01 

125953 H40829 yo05d11.fi Soares adult brain N2o5HB55Y 3 

119155 R61715 Hs.138237 ESTs 3 
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Table 12: H chip - Met vs Normal query - down in Mets 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank secession number 

UnigencIDi Unigene number * 

Unigene Title: Unigene gene title 



Pkey ExAccn UniGJD Complete Jttle Median Mets Al vs Median Normal Al 

103466 Y00339 Hs.155097 carbonic anhydrase II 0.01 

104258 AF007216 Hs.5462 solute carrier family 4; sodium bicarbon 0.02 

108999 AA156054 Hs.72115 ESTs 0.04 

101046 K01 160 Accession not fisted in Genbank 0.04 

133565 H57056 Hs^04831 ESTs 0.05 

101346 L76465 Hs.77348 hydroxyprostaglandin dehydrogenase 15-(N 0.05 

123137 AA487468 Hs. 100686 ESTs; Weakly simflar to secreted cement 0.05 

134534 X73501 Hs.84905 H. Sapiens mRNA forcytokeratin 20 0.05 

118823 N79237 Hs.50813 ESTs; Weakly similar to long chain fatty 0.06 

102095 U11313 Hs.75760 sterol carrier protein 2 0.06 

111855 R37362 Hs.21351 ESTs 0.06 

129105 AA224351 Hs.108681 ESTs 0,07 

130320 U19495 Hs.237356 stromal cell-derived factor 1 0.07 

113778 W15263 Hs.5422 ESTs 0.07 

116786 H25836 Hs.83429 tumor necrosis factor (Ggand) superfami 0.07 

100116 D00654 Hs.77443 adin; gamma 2; smooth muscle; enteric 0.07 

104636 AA004415 Hs.106106 ESTs 0.07 

107032 AA599472 Hs.247309 sucdnate-CoA figase; GDP-forming; beta 0.08 

106605 AA457718 Hs.21103 Homo sapiens mRNA; cDNA DKFZp564B076 (fr 0.08 

128908 AA487557 Hs.10706 ESTs 0.08 

130016 AA055811 Hs. 143131 trarismembrane glycoprotein 0.08 

113523 T90037 Hs.16686 ESTs 0.08 

102638 U67319 Hs.9216 . caspase 7; apoptosis-reiated cysteine pr 0.09 

124308 H93575 Hs^27146 Homosapterrsmf^;c^NADKFZp564J142(fr 0.09 

129519 AA298786 Hs.112242 ESTs 0.09 

134749 L10955 Hs.89485 carbonic anhydrase IV 0.09 

130366 L1170B Hs.155109 hydroxysteroid (17-beta) dehydrogenase 2 0.09 

109272 AA195718 Hs.86030 ESTs 0.09 

102124 U14528 Hs.29981 solute carrier family 26 (sulfate transp 0.1 

132711 N73702 Hs.238927 ESTs 0.1 

131861 D11925 Hs.184245 WAAQ929 protein Msx2 Interacting nudea 0.1 

133806 M12759 Hs.76325 Human Ig J chain gene 0.1 

102571 U60115 Homo sapiens skeletal musde UM-protein 0.1 

114846 AA234929 Hs.44343 ESTs 0.11 

131328 V01512 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 0.11 

106569 AA455983 Hs.117816 sorcm 0.11 

103542 Z11793 Hs.3314 selenoprotein P; plasma; 1 0.11 

128915 C02386 Hs.107139 ESTs 0.11 ' 

120914 AA377254 Hs.97107 EST 0.11 

130867 J04093 Hs^056 UDP grycosyltransferase 1 0.11 

110837 N30796 Hs.17424 ESTs; WeaMy sirnSar to semaphorin F [H. 0.12 

101877 M97496 Hs.778 guajiyiate cyclase activator 1B (retina) 0.12 

132617 AA171913 Hs.5336 carbonic anhydrase XII 0.12 

129113 AA147646 Hs.108740 DKFZP586A0522 protein . 0.12 

133435 T23983 Hs.7365 ESTs 0.13 

132836 F09557 Hs.57929 slit (DrosophBa) homotag 3 0.13 

125832 AA628600 Hs.117587 ESTs 0.13 

104613 AA001049 Hs.24713 Homo sapiens mRNA; cDNA DKFZp586G0123 (f 0.13 

132903 AA235404 Hs.5985 Homo sapiens clone 25186 mRNA sequence 0.13 

119479 W32094 Hs.55501 ESTs 0.14 

131273 AA421139 Hs.173542 ESTs 0.14 

106674 AA461303 Hs.7946 DKFZP586D1519 protein 0.14 

108980 AA151676 Hs.33455 peptidy! arginine deimhase; type II 0.14 

103211 X73079 Hs.205126 polymeric Immunoglobulin receptor 0.14 

131219 C00476 Hs.24395 small inducible cytokine subfamily B (Cy 0.15 

116459 AA621399 Hs.64193 ESTs 0.15 

130219 R77539 Hs.15285 ESTs 0.15 

113863 W68388 Hs.21288 ESTs; Weakly similar to WAA0704 protein 0.15 

101564 M32886 Hs.117816 sorcln 0.15 

109^)2 AA233837 Hs.44755 ESTs; Weakly similar to membrane glycopr 0.15 

107222 D51235 Hs.82689 tumor rejection antigen (gp96) 1 0.15 

135237 AA454930 Hs.9691 ESTs 0.15 
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112483 R66534 Hs.28403 ESTs 0.15 
132387 R70914 Hs.8997 heat shock 70kD protein 1 0.15 
130343 AA490262 Hs.15485 ESTs; Weakly similar to APICAL-UKE PROT 0.16 
105498 AA256323 Hs.25264 DKFZP434N126 protein 0.16 
104037 AA372630 Hs.100347 drfterentraDy expressed in hematopoteti 0.16 
101461 M22430 Hs.76422 phosphollpaseA2; group IIA (platelets; 0.16 

116551 D20458 Hs.229071 EST 0.16 

133889 AA099391 Hs.211582 myosin; i^ht polypeptide kinase 0.16 

103653 270295 Hs.32966 guanylate cyclase activator 2B(uroguany 0.16 

101070 L02785 Hs.1650 down-regulated In adenoma 0.17 

131501 AA121127 Hs.181307 H3histone; family 3A 0.17 

133515 X98311 Hs.74466 cercmoembryonlcantigerwBlatedcslIad 0.17 

108604 AA0S9820 Hs.49696 ESTs 0.17 

132982 L02326 Hs. 198116 Immunoglobulin lambaa-Bke polypeptide 2 0.17 

131676 C20785 Hs.30514 ESTs 0.17 

134675 AA250745 Hs.87773 protein kinase; cAMP-dependent catalyti 0.17 

133441 M82962 Hs.179704 meprin A; alpha (PABA peptide hydrolase} 0.18 

130455 X17059 Hs.155956 ffacetyltransferase 1 (arytamine N-acety 0.18 

131734 D62965 Hs.31297 ESTs 0.18 

100749 HG3521-HT3715 Ras-Retated Protein Raplb 0.18 

116724 F13665 Hs.65641 ESTs 0.18 

129265 X68277 Hs. 171 695 dual specificity phosphatase 1 0.18 

102347 U37518 Hs.83429 tumor necrosis factor (ligand) superfami 0.18 

114542 AA055768 Hs.122576 ESTs 0.18 

123900 AA621223 Hs.112953 EST 0.19 

121780 AA422086 Hs.124660 ESTs 0.19 

115662 AA405715 Hs.64179 hypothetical protein 0.19 

113803 W42789 Hs^1446 ESTs 0.19 

105493 AA256268 Hs.10283 ESTs 0.19 

113195 T57112 yc20g11.s1 Straiagene lung (#837210) Horn 0.19 

129462 D84239 Hs.111732 IgG Fc btrufing protein 0.19 

133664 X86693 Hs.75445 hevin 02 

126180 R18070 Hs.3712 ubiqumoJ-cytochrome c reductase; Rieske 0.2 

100687 H63115-HT3291 GoShMbp (Gb:L18862) 02 

130064 T67053 Hs.181125 ImmunogbbuBn lambda gene duster 0.2 

101367 M12963 Hs.73843 alcohol dehydrogenase 1 (class I); alpha 0.2 

132254 L20826 Hs.430 plastln 1 (I boform) 02 

105646 AA282147 Hs.5888 ESTs 0.2 
132883 AA047151 Hs.5897 Homo sapiens mRNA; cONA DKFZp586P1 622 (f 

132618 AA253330 Hs.5344 adaptor-related protein complex 1 ; gamma 0.21 

108931 AA147186 Hs250746 ESTs 022 

131421 X64177 Hs.2867 metalbthlonein 1H 022 

107295 T34527 Hs.80120 UDP^^tyl^pra^atactosamlne:polyp 0.22 

103576 Z26317 Hs.2631 desmoglein2 0.22 

105173 AA182030 Hs.8364 ESTs 022 

134843 H60595 Hs.90061 progesterone binding protein 022 

102009 U026B0 Hs.82643 protein tyrosine kinase 9 023 

123997 D51171 Hs.78902 voltage-dependent anion channel 2 023 

106609 AA458652 H&32181 ESTs 023 

101300 L40391 Hs.6445 Homo sapiens (clone s153) mRNA fragment 023 

129717 AA481670 Hs.12150 ESTs; Weakly similar to retinal shorten 023 

108565 AA085342 Hs.1526 ATPase; Ca++ transporting; cardiac muscl 023 

121314 AA402799 Hs.182538 ESTs 023 

124803 R45480 Hs.164866 cycflnK 023 

130208 AA620556 Hs.15250 peroxisomal D3;D2-enoy^CoA isomerase 0.23 

132888 AA490775 Hs.5920 UDP^cetylglucosamirre-2-epirnerase^-ac 0.23 

132720 Z69881 Hs.5541 ATPase; Ca++ transporting; ubiquitous 0.23 

102239 U26726 Hs.1376 hydroxysteroid (11-b(te) dehydrogenase 2 023 

115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevts) horn 024 

130558 H96654 Hs.15984 ESTs; Weakly similar to gene pp21 protei 024 

.122666 AA455052 Hs.99387 ESTs 024 

134495 063477 Hs.84087 KIAA01 43 protein 024 

124017 F02202 Hs.100960 ESTs 024 

106925 AM91261 Hs.37558 Homo sapiens clone 23923 mRNA sequence 024 

115187 AA261805 Hs.44021 ESTs 0.24 

105309 AA233790 Hs.4104 ESTs;WeaJdysim^tocDNAESTyk386g7 0.24 

124457 N50114 Hs.128704 ESTs 0.24 
130616 AA233763 Hs.16726 Homo sapiens mRNA; cDNA DKFZp564A132 (fr 

105795 AA369245 Hs,17448 ESTs; Weakly similar to Ifl! ALU SUBFAMI 025 

134579 N23222 Hs.85963 CD36 antigen (collagen type I receptor; 025 
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Table 13: H chip - Met vs Normal query - up in Mets 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Oenbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



Pkey ExAccn UniGJD Comptetejraie Ratio Met/Normal 

102193 U20758 Hs.313 secreted phosphopnotein 1 (osteopontin; 8.457 

111307 N73988 Hs.37477 ESTs; Weakly similar to CGI-141 protein 6.05 

103119 X63629 H&2677 cadhertn3; P-cadheiin (placental) 5.207 

131564 AA491465 Hs.28792 ESTs 5.136 

119729 W69747 Hs.94806 WAA1062 protein 4.667 

124059 F13673 Hs.99769 ESTs 4.398 

123967 C21171 Hs.95497 ESTs; Weaidy similar to GLUCOSE TRANSPOR 4.292 

128817 N47524 Hs.28491 spermldlrte/spermhe N1-acety (transferase 3.964 

133770 M69197 H&242279 haptoglobirHelated protein 3.623 

130412 AA406554 H&241572 gotgi autoantigen; golgln subfamily a; 5 3.719 

104755 AA024482 Hs.9029 DKFZP434G032 protein 3.702 

132676 AA283035 Hs.54813 ESTs 3.645 

134453 X70683 Ks.63484 SRY (sex determining regran Y)-box 4 3.561 

124690 R05818 Hs.173830 ESTs 3.446 

106949 AA496805 Hs.177425 WAAQ964 protein 3.42 

130724 AA370091 Hs.179680 ESTs 3.402 

128992 R49693 Hs.107708 ESTs 3.32 

133421 AA436560 Hs.7327 daudkil 3255 

103047 X55990 Hs.73839 rfoonuclease; RNase A family; 3 (eoslnop 3229 

102990 X51441 Hs.181062 serum amyloid A1 3.149 

115429 AA284139 Hs.89295 EST 3.114 

129158 J05257 Hs.109 dipep&iase 1 (renal) 3.019 

123533 AA608751 H&244904 ESTs; Weakry simitar to 111) ALU SUBFAMl 3.011 
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Table 14: H chip - Met vs Normal query - down in Mets 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Uni gene number 

Unigene Title: Unigene gene title 



PksyY ExAccn UntGJD CompIete.TrUB Ratio Met/Normal 

103466 Y00339 Hs.155097 carbonic anhydrase II 0.012 

104258 AF007216 Hs.5462 solute carrier famify 4; sodium bicarbon 0.025 

108999 AA156084 Hs.72115 ESTs 0.034 

101046 K01 160 Accession not feted m Genbank 0.041 

133565 H57056 Hs.204631 ESTs 0.042 

101346 L76465 Hs.77348 hydroxyprodtsgtandin dehydrogenase 15^N 0.043 

102095 U11313 Hs.75760 sterol carrier protein 2 0.054 

111655 R37362 Hs.21351 ESTs 0.055 

130320 U19495 Hs.237356 stromaJ cetklerived factor 1 0.056 

123137 AA487468 Hs.100686 ESTs; Weakly similar to secreted cement 0.06 

107222 D51235 Hs.82669 tumorrejecttonan^en(gp96) 1 0.06 

102638 U67319 Hs.9216 caspase 7; apoptosis-related cysteine pr 0.063 

128906 AA487557 Hs.10706 ESTs 0.065 

129105 AA224351 Hs.108681 ESTs 0.069 

110837 N30796 Hs.17424 ESTs;Wealdysini^toseirtapnorinF[H. 0.069 

100116 D00654 Hs.77443 actin; gamma 2; smooth muscle; enteric 0.071 

116786 H25836 Hs.83429 tumor necrosis factor (ligand) superfami 0.074 

130867 J04093 Hs.2056 UDP glycosyftraraferase 1 0.075 

132836 F09557 Hs.57929 slit (Drosophila) homobg 3 0.076 

131861 D1 1925 Hs.184245 K1AA0929 protein Msx2 interacting nudea 0.081 

106574 AA461303 Hs.7946 DKF2P5B6D1 519 protein " 0.084 

109272 AA195718 Hs.86030 ESTs 0.088 

132711 N73702 Hs.238927 ESTs 0.091 

106569 AA455983 Hs.117816 sorcin 0.092 

104636 AA004415 Hs.106106 ESTs 0.093 

118823 N79237 Hs.50813 ESTs; Weakly slmter to tong chain fatty 0.094 

134534 X73501 Hs.84905 H. Sapiens mRNA for cytokeratin 20 0.095 

119479 W32094 Hs55501 ESTs 0.096 

113776 W15263 Hs.5422 ESTs 0.098 

128482 U83908 Hs, 100407 programmed ceD death 4 0.102 

124653 N92664 Hs.109641 ESTs 0.106 

133407 AA093348 Hs.7306 secreted frizzJetkelated protein 1 0.108 

135237 AA454930 Hs.9691 ESTs 0.109 

116250 AA480975 Hs.44829 ESTs 0.111 

132617 AA171913 H&5338 carbonic anhydrase XII 0.112 

131273 AA421139 Hs.173542 ESTs 0.113 

116710 F10577 Hs.70312 ESTs 0.114 

131791 S71043 Ks.32225 Bmmunoglobufin alpha 1 0.114 

112483 R66534 Hs.28403 ESTs 0.115 

132017 W67251 H&37331 Homo sapiens vav 3 oncogene (VAV3) mRNA0.116 

124308 H93575 Hs^27146 Homo sapiens mRNA; cDNA DKFZ0564J142 (fr 0.117 

114846 AA234929 Hs.44343 ESTs 0.119 

116551 D20458 Hs.229071 EST 0.12 

105299 AA233511 Hs.194720 ATP-oinding cassette; sub-family G (WHIT 0.122 

130366 L11708 Hs.155109 hydroxysterrid (17-beta) dehydrogenase 2 0.122 

133806 M12759 Hs.76325 Human ig J chain gene 0.122 

104776 AA026349 Hs.31412 ESTs 0.125 

129565 X77777 Hs.198726 vasoactive intestinal peptide receptor 1 0.125 

131272 AA423684 Hs. 139 033 patemaBy expressed gens 3 0.127 

105774 AA348014 Hs.23412 ESTs 0.128 

134604 M22995 Hs.865 RAP1A; member of RAS oncogene family 0.128 

134711 XD4011 Hs.88974 cytochrome b-245; bela polypeptide (chro 0.128 

129113 AA147646 Hs.108740 DKFZP586A0522 protein 0.133 

123995 D51119 Hs.100090 tetraspan3 0.133 

129168 T90621 Hs.1 09052 chromosome 14 open reading frame 2 0.133 

123891 AA621103 Hs.99216 ESTs; Moderately similar to ALU SUB 0.135 

132634 M60830 Hs.5509 ecotropic viral integration site 2B 0.135 

135342 W60097 Hs.99120 DEAD/H (Asp-Glu-Ara-Asp/His) box porypep 0.135 

131510 AA207114 Hs.27842 ESTs; Weakly similar to simlar to 1 -acy 0.137 

133652 AA287383 Hs.7540 ESTs 0.137 

134749 L10955 Hs.89465 carbonic anhydrase IV 0.139 

106586 AA456598 Hs.256269 ESTs 0.139 
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108B93 AA489636 Hs.25253 ESTs 0.139 

101070 L02785 Hs.1650 dowrwegutated in adenoma 0.14 

114293 Z40718 Hs20196 adenylate cyclase 9 0.14 

113966 WB6600 Hs.9842 ESTs 0.141 

101185 L19372 Hs.170087 aiyl hydrocarbon receptor 0.145 

131492 AA393876 Hs.1255 nuclear receptor subfamily 2; group F;m 0.145 

133889 AA099391 Hs211582 myosin; tight polypeptide kinase 0.145 

120914 AA377254 Hs.97107 EST 0.147 

118771 N74690 Hs.50547 ESTs 0.149 

105496 AA256323 Hs.25264 DKFZP434N126 protefo 0.151 

131011 R41771 Hs.22148 ESTs 0.153 

106210 AA428239 Hs.10338 ESTs 0.154 

114069 Z38161 Hs.1 97335 plasma glutamate carboxypeptldase 0.154 

133011 AAD42990 Hs.171921 sema domain; Immunoglobufin domain (lg); 0.154 

115987 AA446887 Hs.42911 ESTs 0.154 

102571 U60115 Homo sapiens skeletal musde UNVorotein 0.155 

100687 HG3115-HT3291 GoI!i-Mbp(Gb:L18862) 0.155 

132903 AA235404 Hs£985 Homo sapiens done 25186 mRNA sequence 0.155 

125832 AA628600 Hs.117587 ESTs 0.155 

130064 T67053 Hs.181125 immunogbbuttn lambda gene cluster 0.157 

123264 AA4910Q3 H&59824 BCE-1 protein 0.159 

130919 AA291710 Hs21Z76 collagen; type IV; alpha 3 (Goodpasture 0.159 

103542 Z11793 Hs.3314 selenoprotein P; plasma; 1 0.161 

101478 M23379 Hs.758 RAS p21 protein activator (GTPase active 0.162 

108921 AA142913 Hs.71721 ESTs 0.164 

100642 HGZ743-HT3926 C^Wesmonl.AILSpItoe.NorvMjscle 0.167 

132109 AA599801 Hs.40098 ESTs 0.167 

115719 AM16997 Hs.59622 ESTs 0.169 

128915 C02386 Hs.107139 ESTs 0.171 

117634 N36421 Hs.107854 ESTs; Weakly similar to SODIUM- AND CHLO 0.172 

129462 D84239 Hs.1 11732 IgG Fc binding protein 0.174 

131328 V01512 Hs25647 v-fos FBJ murine osteosarcoma viral onco 0.176 

130343 AA490262 Hs.15485 ESTs; WeaWy similar to APICAL-UKE PROT 0.177 

115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevis) hom 0.177 

122261 AA436830 Hs.98902 ESTs 0.179 

106605 AA457718 Hs21103 Homo sapiens mRNA; cDNA DKFZp564B076 (fr 0.179 

109991 H09813 Hs.12896 KIAA1 034 protein 0.181 

101300 L40391 Hs.6445 Homo sapiens (clone s153) mRNA fragment 0.181 

123080 AA485303 Hs.205126 polymeric immunoglobulin receptor 0.182 

130016 AA055811 Hs.143131 transmembrane glycoprotein 0.186 

122668 AA455052 Hs.99387 ESTs 0.188 

105453 AA252893 Hs£001 ESTs 0.189 

108980 AA151676 Hs.33455 peptidyl arginine deiminase; type II 0.19 

100248 D31888 Hs.78398 K1AA0071 protein 0.192 

130038 AA195260 Hs20S738 EST s; Moderately simitar to III) ALU SUB 0.192 

110682 N36001 Hs.17348 ESTs; Weakly similar to Ml ALU SUBFAMI 0.193 

131676 C20785 Hs.30514 ESTs 0.195 

111029 N54792 Hs.24697 cytidine mOToriiCBp^ate^cetylneuramlni 0.196 

131257 AA256042 Hs.24908 ESTs 0.196 

133348 T235J7 Hs.7149 ESTs 0.198 

133784 AA214305 Hs.76173 ESTs 0.196 

113863 W68388 Hs212B8 ESTs; Weakly similar to K1AA0704 protein 0.197 

103158 X67235 Hs.118651 hematopofeticaliy expressed homeobox 0.198 

102347 U37518 Hs.83429 tumor necrosis factor (iigand) superfami 02 

111351 N90223 Hs.23392 ESTs 0.2 

123495 AA599850 Hs.106747 ESTs; WeaWy simBar to similar to BPTV 02 

123802 AA620448 Hs.61408 Homo sapiens done 24760 mRNA sequence 0.2 

129243 H88033 Hs.109727 KJAA0733 protein 0.2 

130219 R77539 Hs.15285 ESTs 0.2 

131171 H04644 Hs.167619 ESTs; WeaWy similar to II!! ALU SUBFAMI 0.2 

133746 U44378 Hs.75862 MAD (mothers against decapentapiegic; Dr 02 

116459 AA621399 Hs.64193 ESTs 0201 

109613 F03031 Hs27519 ESTs 0202 

133435 T23983 Hs.7365 ESTs 0202 

103002 X52001 Hs.1408 endotheEn3 0.204 

125153 W38294 Accession not listed in Genbank 0204 

131919 AA121266 rte^4641 ESTs 0.204 

100749 HG3521-HT3715 Ras-Related Protein Raplb 0205 

105085 AA147537 Hs.4811 ESTs 0208 

124571 N67470 Hs.173074 DKF2P56401863 protein 021 

129519 AA298786 Hs.112242 ESTs 021 

116724 F13665 Hs.65641 ESTs 021 

132932 T15482 Hs.6093 ESTs 021 

113803 W42789 Hs.31446 ESTs 0211 

110792 N24899 Hs.6630 ESTs 0212 

105178 AA167490 Hs21941 ESTs 0.212 
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107295 T34527 HsJ012Q UDP^cetifl^ha^alactosaminerpolyp 0212 



115262 AA279112 Hs.88594 ESTs 0213 

115839 AA429038 Hs.40541 ESTs 0213 

103211 X73079 Hs.205126 polymeric immunoglobulin rec^tor 0214 

108604 AA099a20 Hs.49696 ESTs 0215 

105173 AA182030 Hs.8364 ESTs 0217 

108539 AA084677 Hs.54558 ESTs; Weakly similar to protein B [H.sap 0217 

109984 H09594 Hs.10299 ESTs 0217 

133536 Y00264 Hs.177486 amyloid beta (A4) precursor protein (pro 0217 

129965 T71333 Hs.13854 ESTs 0219 

114542 AA05576B Hs.122576 ESTs 0219 

132982 L02326 Hs.198118 Immunoglobulin larnbda-fike polypeptide 2 022 

101809 MB6849 Homo sapiens connexrn 26 (GJB2) mRNA, oo 0222 

105795 AA369245 Hs.17448 ESTs; Weakly similar to lilt ALU SUBFAMl 0.222 

132119 H99211 Hs.40334 ESTs 0222 

132733 R25385 Hs.123654 KIAA0824 protein 0222 

109415 AA227219 Hs, 11 0826 trinucleotide repeat containing 9 0222 

113083 T40530 Hs*241 ESTs; Weakly similar to heat shock prote 0223 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochrome 0224 

103653 Z70295 Hs.32966 guanyiate cyclase activator 2B(uroguany 0225 



104613 AA001049 Hs24713 Homo sapiens mRNA; cDNA DKFZp586G0123 (f 0225 



126180 R18070 Hs.3712 ubtquinok^chrorne c reductase; Rieske 0.227 

132015 D11900 H&3731 ESTs 0.227 

130616 AA233763 Hs.16726 Homo sapiens mRNA; cDNA DKFZp564A132 (tr 02Z7 

132883 AA047151 Hs.5897 Homo sapiens mRNA; cDNA DKFZp586P1622 (f 023 

123169 AA488892 Hs.104472 ESTs; WeaWy similar to Gag-Pol polyprot 0233 

115187 AA261805 H&44021 ESTs 0.234 

116787 H28581 Hs.15641 ESTs 0234 

113195 T57112 yc20g11.s1 Stratagenelung (#937210) Horn 0235 

130707 W45457 Hs203559 ESTs 0235 

124803 R45480 Hs.164866 cydinK 0235 

116844 H64938 Hs.38331 ESTs 0235 

102759 U81607 Hs.768 A kinase (PRKA) anchor protein (gravin) 0238 

130584 AA009839 Hs.180841 tumor necrosis factor receptor superfam! 0.238 

133240 D31161 Hs.68613 ESTs 0238 

132952 AA425154 Hs.61426 ESTs 0239 

132720 Z69881 Hs.5541 ATPase; Ca^trartsporfing; ubiquitous 024 

131734 D62965 Hs31297 ESTs 024 

111890 R38678 Hs.12365 ESTs 0241 

102325 U35139 Hs.50130 necdin (mouse) homotog 0244 

104968 AA084602 Hs.29669 ESTs 0244 

105674 AA284755 Hs214742 CDW52 antigen (CAMPATH-1 antigen} 0244 

120519 AA258585 Hs. 129887 cadherin 19 (NOTE redefinition of symbo 0244 

134675 AA250745 Hs.87773 protein kinase; cAMP-dependent; catalyti 0244 

130642 M63438 Hs.1561 10 ImmunogtobuBn kappa variable 1W .0245 

134418 R78190 Hs.82933 ESTs; Weakly similar to cDNA EST EMBLT0 0245 

115137 AA257976 Hs.56155 ESTs 0245 

131713 X57809 Hs.181125 immunoglobulin lambda gene cluster 0246 

108931 AA147186 HS250746 ESTs 0246 

106609 AA458652 Hs.32181 ESTs 0248 

115559 AA393810 Hs.41067 ESTs 025 

133985 L34657 Hs.78146 ptateletfendothelial cell adhesion molec 025 

134088 D43636 Hs.79025 KIAA0096 protein 0.25 

134487 R38185 Hs.83954 Homo sapiens unknown mRNA 025 
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Table 15: I chip - Met vs Normal query - up in Mets 



Pkey: Unique Eos probcsct identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene tide 



Pkey Ex_Accn UniGJD Title Ratio Met/Normal 

319379 T91443 Hs.193963 ESTs 18.71 

321920 N63915 EST cluster (not in UniGene) 11.9 

314522 AI732331 Hs.187750 ESTs; Moderately simOar to Oil ALU CLA 723 

315720 AW291875 Hs.163900 ESTs 6.06 

308010 AI439190 Hs.181165 eukaryotic translation elongation factor 5.76 

313774 AW136836 Hs.144583 ESTs 5.01 

300734 AW205197 Hs240951 ESTs 3.98 

337895 CH22_EMAC005500.GENSCAN.56-2 3.98 

312339 AA524394 EST cluster (not in UniGene) 3.66 

331644 T99544 Hs.173734 ESTs; Weakly similar to 111! ALU CLASS B 3.53 

324643 AI436356 Hs.130729 ESTs 3.52 

324302 AA543008 Hs.136806 ESTs; Weakly similar to !H! ALU SUBFAMI 3.41 

314912 A1431345 Hs.161784 ESTs 333 

319403 T98413 EST cluster (not in UniGene) 332 

308676 AI761036 EST singleton (not In UniGene) with exon 327 

331858 AA421163 Hs. 163848 ESTs 322 

315178 AW362945 Hs.162459 ESTs 321 

321354 AA078493 EST duster (not in UniGene) 3.18 

337898 CH2^_EMAC005500.GENSCANi^5 3.16 

322682 AM 10679 EST cluster (not In UniGene) 3.15 

313197 AI738851 Hs222487 ESTs 3.1 

308991 A1879831 EST singleton (not in UniGene) with exon 3.08 

310016 AW449612 Hs.152475 ESTs 3.05 
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Table 16: I chip - Met vs Normal query - down in Mets 



Play: Unique Eos probcsct identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UmgenelD: Unigene number 

Unigcne Title: Unigene gene title 



Pkey EiLAccn UniGJD title Ratio Met/Normal 

303041 AF127035 EST duster (not in UniGene) with exon h 0.02 
302360 AJ010901 Hs.198267 mucin 4; tracheobronchial 0.03 
301948 AA344647 Hs.116724 aldo^eto reductase family 1;m8mberB11 0.03 

336091 CH22_FGENES.689_3 0.04 
333657 CH22J : GENES.241_2 0.04 
333858 CH22_FGENES54U 0.04 
333737 CH22_FGENES.261J 0.05 
333655 CH22.FGENES.240..4 0.05 
302347 AF039400 Hs.194659 chbride channel; calcium activated; fam 0.06 
336084 CH22.FGENES.688 13 0.06 
330385 AA449749 Hs.31386 ESTs; Highly similar to secreted apoptos 0.06 
304487 AA434241 EST singleton (not in UniGene) with exon 0.07 
302292 AF067797 EST cluster (not in UniGene) with exon h 0.07 
334030 CH22J=GENES.320_2 0.07 
332859 CH22_FGENES.27_2 0.07 
333654 CH22_FGENES.240_2 0.07 
303270 AL120518 Hs. 105352 ESTs 0.08 
320352 Y13323 Hs.145296 dismtegrin protease 0.08 
333637 CH22_FGENES^29J 0.08 
324094 AA382603 EST cluster (not in UniGene) 0.08 
320590 U67058 Hs.168102 Human proteinase activated receptor^ rnR 0.08 
330622 X63597 Hs.2996 sucrase-isomaltase 0.08 
331441 H75860 Hs.39720 ESTs 0.08 
308601 AI719930 EST singleton (not in UniGene) with exon 0.09 
323770 AA722425 EST duster (not in UniGene) 0.09 
335188 CH22_FGENES.507_3 0.09 
333730 CH22LFGBJES258J 0.09 
304480 AA430373 EST singleton (not in UniGene) with exon 0.09 
336081 CH23.FGENES.688J0 0.1 
332071 AA598594 Hs.112475 ESTs - 0.1 
318538 N28625 Ks.74034 caveolin 1; caveofae protein; 22kD 0.1 
311331 AI679622 H&32225 immunoglobulin alpha 1 0.1 
319668 NMJJ02731 EST cluster (not In UniGene) 0.11 
332567 N23730 Hs35647 v-fos FBJ murine osteosarcoma viral onco 0.11 
319395 AW062570 Hs.13809 ESTs 0.11 
315594 AI983437 Hs.155145 ESTs 0.11 
321539 N98619 Hs.62461 ARP2 (adirwelated protein 2; yeast) ho 0.12 
333647 CH22.FGENES.235J 0.12 
333588 CH22LFGENES.206^2 0.12 
321286 AI380940 EST duster (not in UniGene) 0.12 
320727 U96044 EST duster (not in UniGene) 0.13 
335687 CH2^.FGENES^98J 0.13 
324611 AA743462 Hs.165337 ESTs 0.14 

335115 CH22_FGENES.496J 0.14 
324660 AA541644 Hs.186044 ESTs 0.14 
337951 CH22.EM^CO055OO.GENSCAN.94-1 0.14 

302332 AI833168 Hs.184507 Homo sapiens Chromosome 1 6 BAC done CIT 0.14 

300921 AW293224 Hs.232165 ESTs 0.14 

333646 CH2LFGENES.234J 0.14 

335116 CH2^FGENES.496J 0.14 
320211 AL039402 Hs.125783 DEME-6 protein 0.15 

336092 CH22_FGENES.689_6 0.15 
330673 D57823 Hs.92962 Sec23(S.cerevisiae)homobgA 0.16 

303042 AF129532 EST duster (not in UniGene) with exon h 0.16 
337954 CH22_EM^C005500.GENSCAN.9W 0.16 
336645 CH22.FGENES.26-1 0.16 
335651 CH22LFGENES590J 0.16 
314499 AL044570 Hs.147975 ESTs 0.17 
338124 CH22LFGENES.701J 0.17 
315199 AA877996 Hs.125376 ESTs 0.17 
324525 AW044647 Hs.196284 ESTs 0.17 
320825 NM.004751 EST duster (not in UniGene) 0.18 
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302049 


AA377072 Hs.129792 


Homo sapiens Chromosome 16 BAC clone CIT 


336083 






CH22.FGENES.688J2 


0.18 


333653 






CH22.FGENES239J 


0.18 


323243 


W44372 




EST cluster (not in UntGene) 


0.19 


316610 


AWOB7073 Hs.126731 


ESTs 


0.19 


315033 


AI493046 


Hs.146133 


ESTs 


0.19 


330551 


U39840 


Hs.105440 


hepatocyte nuclear factor 3; alpha 


0.19 


333642 






CH22.FGENES231J 


0.19 


301261 


AA843986 


Hs.190586 


ESTs 


02 


333626 






CH22.FGENES224J2 


0.21 


303792 


C75094 


Hs.199839 


ESTs; Highly similar to NG22 [Haapiens] 


0.21 


332325 


T79428 


Hs.191264 


ESTs 


0.21 


321223 


AA431366 




EST cluster (not in UntGene) 


021 


333635 






CH22.FGENE&228J 


022 


314645 


A1808999 


Hs.207570 


ESTs 


022 


322929 


AI365585 


Hs.146246 


ESTs 


022 


324718 


AI557019 


Hs.116467 


ESTs 


0.22 


335652 






CH22_FGENES.590_3 


022 


307783 


AI347274 




EST singieton (not in UntGene) with exon 


022 


331344 


AA357927 


Hs.70208 


ESTs 


022 


336086 






CH2^FGENES.688J7 


023 


320802 


D83824 


Hs.185055 


BENE protein 


0.23 


335692 






CH22JGBESJS96J 


0.23 


333593 






CH22.FGENES210J 


023 


335667 






CH2^FGENESi90_18 


024 


314653 


AA729232 


Hs.153279 


ESTs 


024 


320244 


AA296922 


Hs.129778 


gastrointestinal peptide 


0.24 


300601 


A1762130 


Hs.165619 


ESTs 


024 


305080 


AA641485 




EST singleton (not m UniGene) with exon 


025 


335189 






CH22JGENES,507_4 


025 
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Table 17: B survivor vs Mets - Up in B survivor 



Pkey: Unique Eos probesct identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 



UnigenelD: Unigene number 
Unigenc Titter Unigene gene title 



Pkey ExAccn UnlGJD Complete Tttle Ratio BS/Met 

101006 J04132 Hs.97087 CD3Z antigen; zeta polypeptide (TTT3 com 728 

114173 Z39Q50 Hs.21963 ESTs 6.13 

130284 X82206 Hs.153951 ARP1 (acfcweteted protein 1; yeast) ho 5.77 

100787 HG36724fT4142 ImmunogtabuDn Gamma Heavy Chain, V(6)Djc Regions (Gb:U13200) 5.63 

132461 AA405775 Hs.49005 hypothetical protein 5.62 

133806 M12759 Hs.76325 Human Ig J chain gene 5.46 

133747 D86972 Hs.75863 KIAA0218 gene product 5.45 

123328 AA49B968 Hs.105403 EST 5.28 

132671 X76302 Hs54649 putative nucleic add binding protein RY 525 

132018 AA293194 HsJ737 ESTs 5.22 

100186 D17516 Hs.4748 adenylate cyclase activating polypeptide 5.14 

107155 AA621202 Hs.7946 DKFZP586D1 51 9 protein 5.1 

103566 Z22555 Hs.180616 CD35 antigen (collagen type I receptor, 5.06 

113355 T79203 Hs.14480 ESTs 4.99 

129040 U38864 Hs.108139 zinc finger protein 212 4.96 

130214 H78003 Hs.15266 ESTs 4.93 

129550 AA48Q991 Hs.113025 ESTs 4.92 

129704 W81301 Hs.12054 ubiquffin specific protease 22 4.91 

116425 AA609574 Hs.51483 ESTs 4.77 

105166 AA179767 Hs.30570 porygWamine binding protein 1 4.65 

118765 N74442 Hs.183696 ESTs 4.6 

108999 AA156064 Hs.72115 ESTs 457 

112756 R93908 Hs35258 ESTs 454 

111655 R16884 Hs.187462 ESTs 4.48 

119392 T90672 Hs238859 ESTs 4.42 

131957 AA609008 Hs.183232 ESTs 4.41 

129275 082061 Hs.109993 Ke6 gene; mouse; human homolog of 4.4 

113634 T95085 Hs.125182 ESTs 4.4 

127187 AA29713B Hs207422 ESTs 4.32 

101147 L13266 Hs.105 gtirtemate receptor, ionotrapic; N-methyl 4.3 

134901 S78873 Hs.90875 RAB interacting factor 4.26 

100896 HG4593-HT4998 Sodium Channel 1 424 

100687 HG311&HT3291 GoUMbp (Gbl18862) 421 

129758 AA599552 Hs.183770 Homo sapiens mRNA; cDNA DKFZp566P2346 (f 4.19 

105440 AA252243 Hs22851 ESTs 4.16 

131551 AA127867 Hs2B608 ESTs 4.15 

113761 T99373 Hs.189786 ESTs 4.09 

105897 AA401091 ESTs 4.07 

129495 AA382529 Hs239676 ESTs 4.06 

103436 X98206 H^apiens mRNA for UV-B repressed sequen 4.03 

104673 AA007633 Hs20010 ESTs 4.03 

128886 L36720 Hs.106880 bystin-Uke 4.02 

100702 HG3236-HT3413 Neurofibromatosis 2 Tumor Suppressor (Gbl27065) 3.99 

123547 AA608820 Ks.124085 WAA0921 protein 358 

134877 AA455241 Hs.90527 ESTs 3.97 

123650 AA509332 Hs.180696 ESTs 3.94 

106482 AA451672 Hs.108824 ESTs; Weakly similar to cDNA EST yk41 5c1 3.94 



101909 S69265 Homo sapiens mRNA tor PLE21 protein; com 3.93 

1 08390 AA075070 zm86b6.s1 Stratagene ovarian cancer (#93 

LYMPHOCYTE ANTIGEN LY-6A2/LY-6E1 PREC 3.93 



135403 U06643 Hs.99923 lectin; gabctoside-blnding; soluble; 7 3.89 

121038 AA398536 H&07365 ESTs 3.88 

128496 T83496 Hs.100610 ESTs 3.86 

108785 AA128946 ESTs 3.86 

119838 W79499 Hs.58580 ESTs 3.85 

130109 L12060 Hs.1497 retinotc add receptor; gamma 3.84 

134538 U79288 Hs.85053 WAA0513 gene product 3.83 

110310 H38209 Hs.3272B EST 3.81 

110433 H49425 Hs.32992 ESTs 3.78 

111834 R36138 Hs.152458 ESTs 3.76 

130903 N27086 Hs21068 ESTs 3.74 



105142 AA164851 Hs.15380 ESTs; Weakly similar to HERV-E envelope 3.73 
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130248 U84569 Hs.153452 chromosome 21 open reading frame 2 3.73 

130645 AA020942 Hs.17200 STAM-Gke protein containing SH3 and ITA 3.73 

123378 AA521043 Hs.185832 ESTs 3.73 

103985 AA313880 EST185737 Colon carcinoma (HCC) ceil lin 3.73 

112397 R60822 Hs.26805 EST 3.72 

100980 J03069 Hs.72931 v-myc avian myefocytomatosls viral onoog 3.72 

102609 U64863 Hs.158297 programmed eel) (team 1 3.7 

108974 AA151402 Hs.46531 ESTs 3.7 

130192 Y12661 Hs.171 014 VGF nerve growth factor inducible . 3.69 

131318 X51699 Hs.2558 bone gan^na-cartoxyglutamate (gla) prate) 3.68 

113759 T99364 Hs.16074 Homo sapiens mRNA; cONA DKFZp564l153 (fr 3.66 

133712 L19267 Hs.198835 dystrophia myotontca-containing WD repea 3.65 

134229 R15108 Hs.8037 ESTs 3.65 

134241 AA300265 Hs. 80540 K1AA0195 gene product 3.65 

124699 R06413 Hs.112278 arrestin; beta 1 3.62 

107343 U03115 Hs.103945 Human V beta T-ceil receptor (TCRBV) gen 3.62 

128511 AA425636 Hs. 10082 potassium intermedlate/smaD conductance 3.62 

105466 AA253412 Hs.21489 ESTs 3.61 

131377 R41389 Hs.26159 ESTs 3.6 

119135 R49548 Hs.169681 death effector domalrwontaining 3.6 

132982 L02326 Hs.198118 immunoglobulin bmbcMce polypeptide 2 359 

128514 H84261 Hs.100843 ESTs; Weakly similar to similar to GTP-b 3.56 

102396 U41804 Hs54411 putative T1/ST2 receptor binding protein 3.55 

134945 R50247 Hs.91600 ESTs 3.55 

134913 X60483 Hs.91031 H4 rtistone family; member D 3.54 

102053 U07664 Hs.37035 homeoboxHB9 352 

121569 AA412686 Hs.97955 ESTs 352 

132560 AAQ05315 Hs.204524 ESTs; Weakly similar to WAA0747 protein 351 

118456 N66580 Hs.161496 EST; Weakly similar to HC1 ORF [Mmuscul 3.51 

111516 R08160 Hs.222529 ESTs; Weakly similar to ilii ALU SUBFAMI 3.51 

116795 H38858 Hs.251783 EST 35 

130377 AA378316 Hs.155182 K1AA1036 protein 3.5 

121774 AA421758 Hs.98361 ESTs 3.49 

123413 AA521446 Hs.103845 ESTs 3.49 

133798 AA444115 Hs.76277 ESTs; Weakly simitar to salivary profine 3.49 

135183 X939S5 Hs.239663 myeloid/lympnoid or rmxecMlneage leukem 3.48 

132479 AA477715 Hs.4953 gojgi autoantigen; goigin subfamily a; 3 3.47 

117191 H99394 Hs.40339 EST 3.47 

130942 X87852 Hs.21432 H^apiens mRNA for SEX gene 3.46 

130700 D55696 Hs.18069 protease; cysteine; 1 (tegumain) 3.43 

131301 T17386 Hs.164501 ESTs 3.43 

100818 HG4018-HT4288 Qploid-Bhiding CeD Adhesion Molecule 3.43 

103393 X94612 Hs.41749 protein kinase; cGMP^pendent; type II 3.43 

131337 AA228116 Hs.170204 KIAA0551 protein 3.42 

133403 X68688 Hs.72991 zinc finger protein 33b (KOX 31) 3.42 

124728 R16231 Hs.106620 Homo sapiens done 23950 mRNA sequence 3.41 

123168 AA488881 Hs. 10521 8 EST 3.39 

123324 AA496932 Hs.105399 KIAA0809 protein 3.38 

106947 AA496685 Hs.37936 suppressor of variegation 33 fJDrosophil 3.38 

116717 F11065 Hs.79363 ESTs 336 

102794 U88629 Hs.173334 ELL-RELATED RNA POLYMERASE II; ELONGATIO 334 

117503 N31963 Hs.44286 ESTs 3.33 

112220 R50295 H&25703 ESTs 3.33 

106340 AA441792 Hs.22857 chord domain-containing protein 1 333 

106308 AA436186 H&30662 ESTs 3.32 

130894 D16105 Hs.210 leukocyte tyrosine kinase 3.31 

120039 W92548 Hs.94985 ESTs 331 

131428 U17838 Hs^6719 PR domain containing 2; with ZNF domain 3.3 

113285 T66830 Hs.182712 ESTs 33 

109458 AA232648 Hs37068 ESTs 329 

132134 AA242904 Hs.40637 prollne-rfch Gia (G<arbo*ygiutamic acid 3.29 

118964 N93330 Hs54937 Homo sapiens clone 24722 unknown mRNA; p 3.29 

127621 A1216205 Hs.116204 ESTs 3.29 

135149 U40002 Hs.95351 lipase; rwrmone-sensitive 3.28 

114371 Z41835 Hs.27810 ESTs 3.28 

130043 AA055404 Hs.193953 ESTs; Weakly similar to III) ALU SUBFAMI 3.27 

121347 AA405181 Hs.97972 ESTs 3.25 

105754 AA302657 Hs.192028 ESTs 3.25 

121327 AA404286 Hs.173125 peptldytprolyl isomerase F (cydophlfln 325 

111204 N68295 Hs37982 ESTs 325 

120949 AA397830 Hs.98347 ESTs; Weakly similar to GUOMA PATHOGENE 3.25 

130024 U15197 Hs.241560 Human hteto-bbod group ABO protein mRNA 3.24 

125005 T61449 Hs.193727 ESTs 3.24 

121067 AA398662 Hs.97302 ESTs 324 

120996 AA398281 Hs.143684 ESTs 323 

117101 H94043 Hs.24341 DKFZP586I1419 protein 323 
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130708 U40490 Hs.18136 nicotinamide nucleotide transhydrogenase 323 



130270 L40399 Hs.153820 hypothetical protein 322 

131605 AA256220 Hs29383 ESTs 322 

100854 H6419WT4464 SodiunVHydrogan Exchanger 5 322 

123026 AA481072 Hs.99743 ESTs 321 



108328 AA070204 zm68b3^1 Stratagene neuraepWum (#9 32 



104259 AF007833 Hs.159265 Homo sapiens kruppekelated zinc finger 32 

133711 J04130 Hs.75703 smafllntodbtecytoW^ 32 ' 

112261 R52145 Hs.25694 ESTs; Highly simiter to hypothetical pro 3.19 

119529 W38053 Accession not feted In Genbank 3.19 

122386 AA446221 Hs.6092 F-box protein containing leuctoe-rlch re 3.19 

109157 AA179161 Hs.73562 ESTs 3.19 

119903 W85707 Hs.75936 erythrocyte membrane protein band 43 (d 3.18 
127452 AA491317 aa65c01.r1 NCLCGAP.6CB1 Homo sapiens cD 3.16 

124229 H62793 Hs.221892 ESTs 3.18 

129221 AA417126 Hs.109571 trartslocase of inner mitochondrial membr 3.17 

133185 AA481404 Hs.6686 - ESTs 3.16 

121479 AM11911 Hs.98110 ESTs 3,16 

133872 T79868 Hs.180903 hypothetical protein 3.16 

132504. U12897 Hs.5022 imprinted in Prader-Willi syndrome 3.16 

103089 X60382 Hs.179729 collagen; type X; alpha 1 (Schmid metaph 3.15 

129654 AA019943 Hs.118463 H^apiens mRNA for unknown Ever orphan 3.15 

117295 N22360 Hs.43153 ESTs 3.15 

107349 U48224 Hs.158321 beaded fBament structural protein 2; ph 3.14 

103451 X99459 Hs.154782 adaptor-related protein complex 3; stgma 3.14 

114854 AA235056 Hs.120244 ESTs 3.14 

121044 AA398551 Hs.97374 ESTs 3.13 

128582 U22963 Hs.101840 major histocompaffl)nitya)mplex;ctess 3.13 

112598 R78565 Hs.138395 EST 3.13 

113170 T54342 Hs222506 ESTs 3.13 

111714 R23146 Hs23466 ESTs 3.13 

111609 R33616 HS24688 EST , 3.12 

115249 AA278961 Hs.71124 ESTs 3.11 

103228 X75546 H&230 fibrortodulln 3.11 

129944 L00389 Hs.1361 cytochrome P450; subfamily I (aromatic c 3.11 

107927 AA028915 Hs237709 EST 3.11 

130297 H94949 Hs.171955 trophinirwssisting protein (tastin) 3.1 

125742 H81181 Hs.183654 ESTs; Weakly similar to unknown [S.cerev 3.1 

134802 L35546 Hs.89709 giutamate-cysteine ligase (gamma-gtutarny 3.1 



112560 R72293 Hs.6179 Homo sapiens mRNA; cDNA DKF2p586K2322 (f 3.1 



129266 AA343881 Hs, 20 9061 sudD (suppressor of Wnfl6; Aspergillus n 3.09 

126982 AA211419 small inducible cytokine A5 (RANTES) 3.09 

131594 H29723 Hs29261 ESTs; Weakly similar to serine protease 3.08 

134910 AA431320 Hs.9100 ESTs 3.08 

103505 Y09912 Hs.33102 transcription fectorAP-2 beta (activatr 3.08 

110525 H57330 Hs.37430 EST 3.07 

123276 AA491270 Hs.187946 ESTs 3.06 

130519 H91819 Hs.10669 ESTs; Moderately similar to K1AA0400 [H. 3.06 
126621 AA192638 zq01h08.r1 Stratagene muscle 937209 Homo 3.05 

134327 AF006041 Hs.178743 deatrwssoclated protBin 6 3.04 

103513 Y10209 H^apiens mRNA for CD3L protein 3.04 

131243 R16667 Hs.24752 spectrin SH3 domain binding protein 1 3.04 

115167 AA261805 Hs.44021 ESTs 3.04 
107543 Z43703 Hs.4552 Homo sapiens HR1HFB2157 mRNA; partial cd3.04 

134051 S67070 Hs.78846 heat shock 27kD protein 2 3.04 

113461 T88737 Hs. 193536 ESTs 3.03 

130490 X57522 Hs. 158164 ATP-binding cassette; suWamDy B (MDR/ 3.03 

128843 AA234141 Hs203004 katanin p80(WD40<ontaining)subunitB 3.03 

100941 HG862-HT862 Transition Protein 2 3.03 

122268 AA436855 Hs.1782Q2 ESTs 3.02 

107425 W26719 Hs.30204 ESTs 3.02 

130930 U19261 TNF receptor-associated factor 1 3.02 

132958 W90398 Hs.6147 WAA1075 protein 3.02 

100973 J02888 Hs.73956 MAD(P)H menadione oxidoreductase 2; dtox 3.01 

104924 AA058532 Hs28774 ESTs 3.01 

129998 Y10055 Hs.162808 phosphoinositide^kirtase; catalytic; de 3.01 

130023 X13461 Hs239600 caimoduTin-Eke 3 3.01 

129536 M33493 Hs. 184504 tryptase; alpha 3 

112015 R42836 Hs23198 ESTs 3 

103036 X54925 Hs.83169 matrix metafloproteinase 1 (interstitial 2.99 

100756 HG35654fT3768 Zinc Finger Protein (Gb:M88357) 2.99 

103425 X97301 H.sapiens mRNA for Ptg-11 protein 2.99 

118291 N63076 Hs.138746 EST 2.98 
125877 H15229 ym30g04.r1 Scares infant brain 1MB Homo 

repetitive element ;, mRNA sequence. 2.98 

101371 M13232 Hs.36989 coagulation factor VII (serum prothrombi 2.98 
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102958 X15675 Hs.93174 Human endogenous retrovirus pHE1 (ERV9) 297 

121183 AA400138 Hs.97703 ESTs 2.97 

119241 T12559 Hs.221382 ESTs 2.96 

115067 AA253458 Hs.91299 postmelotic segregation increased 2-Gke 2.96 

126196 AA084394 znQ5g10.s1 Stratagene hNT neuron (#93723 296 

111642 R16153 Hs.128740 ESTs; Highly similar to DNb-5 [H.saptens 2.95 

100898 HG4638-HT5050 SpBceosomal Protein Sap 49 2.95 

129370 AA287879 Hs.110798 ESTs; Moderately simflar to GTP-blnding 2.94 

128915 C02386 Hs.107139 ESTs 294 

101868 M96233 Hs.82891 glutathione S-transterase M4 2.94 

124394 N29724 gamma2-adaptin 193 

103559 Z19585 Hs.75774 thrombospondin4 2.93 

107882 AA025630 Hs.17801 ESTs; Moderately similar to serine/proii 2.93 

134919 T99639 Hs.91142 KH-type splicing regulatory protein (FUS 2.92 

110293 H30258 Hs.37165 cotegen;typelX;alpha2 292 

132433 AA082546 Hs.48516 ESTs 292 

127347 AA428350 ESTs 252 

121976 AA429807 Hs£8632 ESTs . 2.91 

133025 AA135492 Hs.6318 ESTs; Highly similar to peroxisomal shor 251 

133413 S72043 Hs.73133 metaJlofttonein 3 (growth Inhibitory fee 2.91 

111694 R22035 H&23331 ESTs 2.91 

128369 F12681 H&205300 ESTs 2.9 

102464 U49260 Hs.3828 mevalonatB (diphospho) decarboxylase 29 

135358 C21431 Hs.99486 ESTs; Weakry similar to araiarl [H.sapie 29 

108661 AA113287 Hs.65905 ESTs; Weakly similar to PTB-ASSOCIATED S2.9 

102185 U20230 Human guanyl cyclase C gene, partial ods 289 

122071 M431787 Hs58762 EST 289 

102040 U06088 Hs.159479 galactosamine (N-acetyH^ulfate sulfa 289 

115689 AA410645 Hs.199014 ESTs 2.88 

1351 10 T15817 Hs.193788 nitric oxide synthase 2A (inducible; hep 2,88 

118729 N73717 Hs.161526 EST 2.88 

129518 AA369807 Ks.112238 ESTs 2.88 

125788 R74309 Hs.44499 smaO EDRK-rfch factor 2 2.87 

128650 U57971 Hs.103124 ATPase; Ca+^- transporting; plasma membra 287 

125936 H30751 Hs. 182859 lifeguard 287 

100779 HG3731-HT4001 Immunoglobulin Heavy Chain, Vdjrc Regions (Gb:L23556) 287 

104451 M13299 Hs.1021 19 blue cone pigment 2.86 

133539 M21574 Hs.74615 platelet-derived growth factor receptor; 2.86 

119506 W37833 Hs.55563 ESTs 2.86 

126568 AA190515 zp85d1 2 j1 Stratagene HeLa cell s3 93721 286 

134184 X53742 Hs.79732 fibuUnl 286 

127633 A1339609 Hs. 152733 potassium voltage-gated channel; Isk-rel 286 

128716 AA045978 Hs.173611 NADH dehydrogenase (ubiquinone) Fe-S pro 2.86 

107135 AA620782 Hs.23247 ESTs 285 

117748 N47317 Hs.141858 ESTs 2.85 

124030 F04143 Hs.151032 Homo sapiens done 23856 unknown mRNA; p 285 

135120 AA449841 Hs.1083Q0 NOT3 (negative regulator of transcriptio 2.84 

102156 U17977 HSU17977 Humn fibroblast cDNA H sapiens 284 

129418 AA401401 Hs.11127 PET112freastlwmologHike 284 

103222 X74795 Hs.77171 minichromosome maintenance deficient {S. 284 

125145 W38001 Accession not listed in Genbank 263 

100560 HG2228-UT2305 Crystaflin, Beta B 283 

105370 AA236476 Hs22791 ESTs; Weakly similar to transmembrane pr 283 

127036 AH68598 ESTs 283 

128788 AA029073 Hs.105685 ESTs 2.83 

119523 W38041 ' Accession not fisted in Genbank 282 

126436 N31224 Hs.211579 melanoma adhesion molecuia 282 

126559 R15866 Hs. 170263 tumor protein 53-binding protein; 1 282 

118183 N59287 Hs.48361 EST 282 

101298 140387 Hs. 11 8633 Z-S'oSgoadenytate synthetase-Dke 281 

131830 U33054 Hs.32959 G protein-coupled receptor kinase 2 (Dro 281 

124173 H41281 Hs.107619 ESTs 2.81 

102295 U32581 Homo sapiens KIAA0421 mRNA; partJai cds 2.81 

129719 N66396 Hs.167766 ESTs;MotorateJysrmilaTtoPR>a2(X0l 281 

126573 AA482023 Hs.155218 E1B-55kDa^ssociated protein 5 281 

125477 AI270093 Hs234642 aquaporin3 261 

106492 AA451696 Hs.7922 ESTs; Weakry similar to contains simflar 

p19; an RNA polymerase II elongation fa 2.8 

132881 T86118 Hs.58875 ESTs 28 

114733 AA133778 Hs35734 ESTs 279 

104616 AA001611 H&186494 ESTs 279 

134137 F10045 Hs.79347 K1AA0211 gene product 2.79 

133212 U82979 Hs.67846 leukocyte Ig-fike recepton subfamBy B 2.78 
100882 HG4460-HT4729 Immunoglobulin Heavy Chain, Vojc Regions (Gbl23564) 278 

104756 AA024622 Hs.15813 solute carrier family 22 (organic cation 278 

129861 N69507 Hs.129849 DKF2P564M182 protein 278 
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120824 AA347548 Hs.96876 ESTs 278 

100684 H63107-HT3283 Plasma Membrane Calcium Pump Hpmca2a 2.78 

121789 AA423970 Hs.178111 ESTs 2.78 

101647 M59941 Hs.118200 colony stimulating factor 2 receptor, be 2.78 

113722 T97957 Hs.202948 ESTs;WeaWysimlartoaltematfvelysp 2.77 

115107 AA256371 Hs.186645 ESTs 2.77 

111464 R05518 Hs.19521 ESTs 277 

108446 AA079120 zm95eU1 Stratagene colon HT29 (#937221 277 

123921 AA621329 Hs.250671 Hu DNA seq frm clone 1163J1 onchr22q13 

prot (similar to mouse Ceisrl; rat MEGF 277 

134445 M59488 Hs.83384 S100 calchmvbindlng protein; beta (neur 276 

114132 Z38688 H&24192 ESTs 276 

120500 AA256430 Hs.132525 ESTs 276 

101860 N85610 Hs.37165 collagen; type IX; alpha 2 276 

134430 H52105 Hs.8309 WAA0747 protein 276 

124152 H27216 Hs.107635 ESTs 276 

132268 AA058833 Hs.23445 ESTs; Weakly smlr to simBar to Mmuscu 276 

116257 AA481493 Hs.88537 ESTs 276 

102438 U46570 Hs.7733 tetratricopeptkle repeat domain 1 275 

122393 AA446334 Hs.99064 ESTs 275 

107653 AA010210 Hs.47041 ESTs 275 

123674 AA609473 Hs.105187 ESTs; Moderately similar to klnesin Oke 275 

129858 T66906 Hs.12970 ESTs 275 

130117 U06641 Hs.150207 HDP gtycosyftransferase 2 family; polype 275 

133464 M13982 Hs.73917 interieukm4 275 

127039 AA233366 Hs 256491 ESTs 274 

128318 AA418202 Hs.13810 ESTs 274 

123363 AA504818 Hs.171279 ESTs 274 

127654 AA649249 Hs.75640 natriuretic peptide precursor A 274 

132067 L20860 Hs.178382 glycoprotein !b (platelet); beta potypep 274 

125664 AA948416 Hs.25744 ESTs; WeaWy similar to Ydr412wp |S.cere 273 

132354 L05187 Hs.211913 small profine-ffch protein 1A 273 

101568 M33764 Hs.75212 ornithine decarboxylase 1 273 

101438 M20777 Hs.159263 Homo sapiens; a!pha-2 (VI) collagen 273 

116233 AA479082 Hs.61142 ESTs 273 

122194 AA435882 Hs.97531 ESTs . 272 

113995 W88466 Hs.22010 ESTs 272 

124251 H68286 Hs.107924 ESTs 271 

120583 AA281304 Hs.78614 complement component 1 ; q subcomponent b 271 

134958 U72507 Hs.234216 Human 40871 rnRNA partial sequence 2.71 

124280 H85835 Hs. 100058 dihydropyrtnidinase-like 4 271 

130113 M64673 Hs.1499 heat shock transcription factorl 271 

106588 AA456612 Hs.25682 ESTs; WeaWy smlr to PHOSPHATIDYLETHANOL 271 

132023 F01927 Hs.3743 ESTs; WeaWy similar to profine-rich pro 27 

112284 R53558 Hs.26052 ESTs 27 

107897 AA026240 Hs.61387 ESTs 2.7 

122610 AA453598 Hs.99336 ESTs 27 

119070 R2778B Hs.52302 ESTs 2.7 

103491 Y08836 Homo sapiens rnRNA for HRX-like protein 2.7 

108225 AA058843 Hs.161620 EST 27 

105829 AA398290 Hs.21965 ESTs 269 

127749 AI251757 Hs.145234 ESTs 269 

128428 A1185718 Hs. 143900 ESTs 2.69 

108409 AA075578 zm88h3.s1 Stratagene ovarian cancer (#93 269 

114739 AA134923 Hs.103833 ESTs; WeaWy similar to predicted using 268 

128821 D87002 Hs.135 multiple UnKBene matches 268 

107412 W26105 Hs.8961 ESTs 268 

1 17012 H85893 Hs.1943B7 ESTs; WeaWy similar to HI! ALU SUBFAMI 268 

135262 AA416551 Hs.9732 ESTs 268 

105367 AA236397 Hs.20304 ESTs 2.68 

134771 L13939 Hs.89576 adaptor-related protein complex 1; beta 2.68 

105036 AA128617 Hs.25549 ESTs 2.68 

125093 TB2930 Hs.186750 ESTs 268 

119340 T61899 Hs.90677 ESTs; Highly similar to CGI-82 protein [ 267 

132603 H62900 Hs.53066 hsp70-interacting protein 267 

113733 T98386 Hs.184548 ESTs 267 

123564 AA608902 Hs.112612 ESTs 266 

116059 AA454165 Hs.53455 ESTs 2.66 

125803 R79373 Hs.29852 ESTs 266 

123012 AA479962 Hs.139636 EST 266 

106080 AA418046 Hs.35124 ESTs 266 

128809 T59668 Hs.102267 lysyl oxidase 266 

104354 H08988 Hs.113759 ESTs 266 

107068 AA609028 Hs.8032 ESTs 265 

101418 M17754 Hs.1276 BN51 (BHK21) temperature sensitivity com 265 

135157 AA460138 Hs.95582 SRY (sex«ietermining region YHjox 20 265 
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123312 AA488258 Hs.99601 ESTs 2.65 

130034 C00350 Hs.14454 chromosome 2 open reading frame 1 2.65 

103897 AA248870 Hs.55058 ESTs 2.65 

117771 N47961 Hs.46794 ESTs 2.65 

109980 H09529 Hs.98693 DKFZP586J0917 protein 2.64 

121966 AA429653 Hs.96616 EST 164 

114233 Z39652 Hs.27457 ESTs 2.64 

129594 R70379 Hs.1 15396 Human germtme IgD chain gene; C-region; 2.63 

102319 U34587 Hs.66578 corticotropin releasing hormone receptor 2.63 

111700 R22212 Hs.23361 ESTs 2.63 

127365 AA001628 Hs.74335 heatsh«*90kD protein 1; beta 2.63 

104205 M496240 Hs.17270 DKFZP434C211 protein 2.63 

124559 N66223 H&135928 ESTs; Weakly similar to 111! ALU SUBFAMI 2.63 

106351 AA442772 Hs.191987 ESTs; Weakly similar to 1111 ALU SUBFAMI 2.63 

121903 AA427605 Hs.258742 rnyosin-binding protein C; cardiac 162 

116442 AA620310 Hs.184343 ESTs;Wea%sirnilar toKIAA0585pr(>tein 2.62 
127041 F06090 HSCOWG031 normalized Infant brain cONA H 2.62 

132860 U93049 Hs58435 FWbkiding protein (FYB-12071 30) 162 

131591 L22454 Hs.180Q69 nuclear respiratory factor 1 2.61 

118118 N56901 Hs.47995 ESTs 2.61 

134809 X52611 Hs.18387 transcription factor AP-2 alpha (activat 2.61 

117706 N45091 Hs.46472 ESTs 2.61 

127488 AA312179 Hs.178617 ESTs; WeaWy similar to CGM2 protein [ 2.61 

114691 AA235984 Hs.87469 ESTs 16 

116426 AA609668 Hs.71657 ESTs 2.6 

132589 AA432197 Hs.5260 ESTs; Weakly similar to CGW8 protein [ 2.6 

128410 AA452788 zx39g11.r1 Scares„totaLfetus_Nb2HF8_9w 2.6 

106081 AA418394 Hsi5354 ESTs 2.6 

129919 R02003 Hs.1 91208 ESTs; Weakly similar to weak similarity 159 

124672 R00307 Hs.188504 ESTs 259 

122758 AA459013 Hs.99742 X-ray repair complementing defective rep 259 

125656 AA040118 Hs.78687 neutral sprung omyeOnase(N-SMase)activ 259 

130052 J 00220 Hs.1 45288 Human !g active epsttonl 5" UT; V-CkJ re 259 

134876 U28055 Hs35Q826 macrophage stimulating; pseudogene 9 259 

131908 L05624 H&3446 rnHoge/MKfivated protein kinase kinase 159 

126470 AA843339 Hs.193168 ESTs; Weakly similar to CGI-52 protein [ 259 

132353 M31651 Hs.46319 sex hormone-binding globulin 258 

119588 W44559 Hs.142525 ESTs 258 

131757 D17532 Hs.316 DEAD/H (Asp-Glu^Ala-Asp/His) box polypep 258 

118114 N56875 Hs.143212 cystatin F (teukocystatin) 258 

128200 A1279952 Hs.158037 ESTs; Weakly similar to transcripfion re 258 

(31208 C14586 Hs.24220 Homo sapiens mRNA; cONA DKFZp566M051 (fr 

124721 R11131 Hs.154966 ESTs 257 

108706 AA121S20 Homo sapiens mRNA for WAA0842 protein; 257 

118831 N79592 Hs50838 ESTs 257 

115708 AA412212 Hs.44033 ESTs 257 

107233 D59322 Hs.22595 ESTs 257 

129559 AA234945 Hs.11360 ESTs 2.57 

(26953 AA743849 Hs.127286 ESTs 156 

108165 AA055221 Hs.63168 ESTs 256 

I04069 AA401547 Hs.172694 ESTs 256 

112146 R46512 Hs.25374 ESTs 256 

I08384 AA074891 Hs.124917 ESTs; Highly similar to WAA0838 protein 256 

(31779 R49047 Hs.179779 rfcosomal protein L37 256 

111829 R36070 Hs.25079 EST 255 

I03424 X97267 Hs. 155975 protein tyrosine phosphatase; receptor t 255 

100133 D13118 Hs.80986 ATP synthase; H+ transporting; mitochond 2.55 

1 30208 AA620556 Hs.1 5250 peroxisomal D3;D2-enoyl-CoA tsomerase 255 

I24649 N92593 Hs.102907 ESTs 255 

106511 AA452665 H&206713 UDP^atbetaGteNAc beta 1;4- galactosyl! 255 

(28467 AA176446 Hs.160428 EST s; WeaWy similar to hypothetical 43. 255 

113524 790072 Hs.15060 ESTs 155 

107821 AA020991 Hs.172856 ESTs 155 

111900 R39044 H&25318 Homo sapiens clone 25194 mRNA sequence 154 

109908 H05255 Hs.203237 EST . 154 

132069 D87454 Hs.192966 K1AAQ265 protein 254 

130660 T95262 Hs.17538 ESTs 254 

112983 T23443 Hs.7111 ESTs 154 

(26279 H08885 yI88b08.ii Soares infant brain 1NtB Homo 154 

108415 AA447994 H&29188 ESTs 153 

116741 H03268 Hs.181746 EST 253 

103148 X66362 Hs.2994 PCTAIRE protein kinase 3 153 

132336 AA342422 Hs.45073 ESTs 253 

129484 R92488 Hs.111989 ESTs 153 

110169 H19696 Hs.31612 ESTs; Moderately similar to CAGH4 [h\sap 153 

116880 H68380 Hs.144174 EST 253 
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133511 
126037 
132678 
128751 
133664 
126977 
120697 
128571 
104422 
122372 
112154 



X04106 
M85772 
AA599876 
AA442274 



AA309665 
AA291522 
AA416619 



115000 
110632 
129154 
107440 
105694 



134462 
101800 
119884 
110289 
125506 
102954 
127851 
126179 
129443 
104480 
115580 



AA446008 

R46769 

R16034 

AA251342 

H72344 

N23673 

W28069 

AA287109 

AA430388 

U11Q37 

M85276 

W81606 

H29829 

H54273 

X15393 

AI469331 

AI191445 

W69967 

N41486 

AA398695 



Hs.74451 

Hs.6Q66 

Hs.5486 

Hs.183176 

Hs.75445 

Hs.97250 

Hs.101661 

Hs.132909 

Hs.99044 

Hs.25388 

Hs.12701 

Hs.144584 

Hs.171635 

Hs.108969 

Hs.251993 

Hs.37883 

H&13144 

Hs.83620 

Hs.105806 

Hs58662 

Hs.31524 

Hs.154073 

Hs.2813 

Hs.130497 

Hs.143855 

Hs.111497 

Hs.99654 

Hs.144339 

Protl37A) 



calpatn; small polypeptide 
KIAA1112 protein 
ESTs 
ESTs 



2.53 
153 
2.53 
2.52 
2.52 
2.52 
2.52 
152 
252 
252 
2.52 
151 
151 
151 
251 
251 
151 



EST180547 JurtettT-ceOs V Homo sapiens 
EST 
ESTs 
ESTs 
EST 
ESTs 

ESTs; Highly simfer to piasmollpin [H.s 
ESTs 
ESTs 

mannosidase; alpha; class 2B; member 1 
ESTs; Weakly similar to similar to zinc 
ESTs 

ESTs; Weakly similar to ORFYGR038w[S.c 151 

seM (suppressor of Pn-12; Cetegans}- 251 

granutysin 251 

Homo sapiens mRNA; cONA DKF2p564G212 (fr 151 

ESTs 151 

UDP-garactose transporter related 151 

motffin 251 

ESTs; Weakly simaar to CHLORIDE CONDUCT 25 

ESTs; Highly similar to IROQUOIS-CLASS H 25 

ESTs; Moderately similar to neuronal pro 25 

protein-O^nannosyftransfiersse 1 15 

Hu DNAseqfrm done 495010 on chr 6q26- 

pseudogene; last exon of gene for a novel protsmlr to worm E04F6.2; ESTs; STSs and 



15 



119595 


W45031 


HS55878 


EST 


25 


103336 


X85785 


Hs.183 


Duffy blood group 


15 




U87964 


Hs^27576 


OTP binding protein 1 


149 


129643 


L27584 


Hs.250712 


calcium channel; voltage-dependent beta 


149 


134503 


U34880 


Hs54183 


dlptheria toxin resistance protein reqrd 


149 


117245 


N20989 


Hs.42927 


ESTs 


2.49 


126888 


H78745 


Hs.1063 


small nuclear ntonucteoproteln polypept 


149 


135313 


D63484 


Hs.98508 


K1AA0150 protein 


149 


121186 


AA400156 


Hs.183294 


ESTs 


149 


130651 


X04445 


Hs.1734 


inhlbbi; alpha 


2.49 


134218 


AA227480 


Hs.80205 


pim-2 oncogene 


149 


104008 


AA334630 




EST38874 Embryo, 9 week Homo sapiens cDN 


129705 


X78706 


H3.12D6B 


carnitine acetyl transferase 


149 


127900 


AI143912 


Hs.121824 


ESTs • 


149 


104609 


R96417 


Hs.107795 


ESTs 


2.48 


131628 


U47292 


Hs.2979 


trefoil factor 2 (spasmolytic protein 1) 


148 


132184 


U51003 


Hs.419 


cOstaHess homeobox2 


148 


130450 


U70735 


Hs.15591 


COP9 subunft6 (MOV34 homotog; 34 kD) 


148 


101679 


M62628 


Hs.163271 


Human alpha-1 Ig germfine C-region membr 


148 


120858 


AA350147 


Hs.96940 


EST 


2.48 


101012 


J04444 


Hs.697 


cytochrome c-1 


148 


110453 


H52133 


Hs.33026 


ESTs; WeaWy simfiar to similar to Enter 


148 


133771 


M68891 


Hs.760 


GATA-binding protein 2 


148 


102944 


X14445 


Hs.37092 


fibroblast grwth fctr 3 (murine mammary 


148 


113269 


T65159 


Hs.85044 


ESTs 


148 


107069 


AA609045 


Hs.11759 


ESTs; Weakly similar to ill! ALU CLASS B 


2.46 


100476 


HG1019+m(M9 


Serine Kinase Psk-H1 


2.47 


106457 


AA449718 


HS77801 


zinc finger protein 278 


2.47 


105718 


AA291629 


Hs.74335 


heat shock 90kD protein 1; beta 


147 


104925 


AA058683 


Hs.5548 


Homo sapiens clone 23765 mRNA sequence 


147 


109913 


H05527 


Hs.31588 


ESTs 


2.47 


103412 


X96698 


Hs.42957 


metfryltransferase-Dke 1 


2.47 


102326 


U35246 


Hs^26025 


vacuolar protein sorting 45A (yeast homo 


147 


116813 


H49911 


Hs.93102 


ESTs 


147 


123690 


AA609566 


Hs.112723 


EST 


147 


124714 


R09486 


Hs.193118 


ESTs 


2.47 


126154 


AI004105 


Hs.14232 


ESTs; Moderately similar to KIAA0563 pro 


2.47 


118880 


N90168 


Hs.54593 


EST 


147 


122274 


AA437094 


Hs.184456 


ESTs; Weakly similar to Ml ALU SUBFAM1 


2.46 


129600 


N76980 


Hs.11567 


ESTs; Moderately similar to unknown [H.s 


2.46 


121356 


AA405437 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp5B6E171 (fr 


109560 


F01778 


Hs.8154 


ESTs 


146 


123342 


AA504336 


HS31659 


thyroid hormone receptor-associated prot 


2.46 


128032 


AI150084 


Hs.126678 


ESTs 


146 


129101 


H90310 


Hs.108665 


ESTs; WeaWy slmflarto CELL-CYCLE NUCLE 



149 



2.46 



2.46 
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131185 M25753 Hs.23960 cycflnBI 2.46 

121451 AA411008 Hs.98085 EST Z46 

104328 D81932 HUM424C5B Hu fetal brain (TFujiwara) H s 2.46 

126543 AA723810 Hs.69517 ESTs; Highly simflarto differentially e 2.45 

123600 AA609106 Hs.112644 ESTs 2.45 

131020 AA411756 Hs.20594 ESTs; Weakly similar to misato p jnelano Z45 

134191 W2B902 Hs.7979 K1AA0735 gene product 2.45 

130446 X79510 Hs.155693 protein tyrosine phosphatase; non-recept 2.45 

131613 R88228 Hs.29595 JM4 protein 2.45 

118864 N89670 Hs.42148 ESTs; Weakly similar to Su(P) [D.melanog 2.45 

104232 AB002351 Hs.10587 KIAA0353 protein £45 

122604 AA453489 Hs.99333 ESTs 2.45 

120626 AA285064 Hs.104485 EST 2.45 

116655 F03866 Hs.68090 ESTs 2.44 

116267 AA485080 Hs.256539 ESTs Z44 

114944 AA243172 H&87619 TED protein 2.44 

127629 AA293279 Hs.29173 ESTs 2.44 

120350 AA211300 Hs.104166 ESTs 2.44 

103620 Z47Q87 Hs.182643 transcription elongation factor B (Si! I) 2.44 

131420 Z11737 Hs.2664 flavin containing monooxygenase 4 2.44 

131312 AA399226 Hs.25527 tlgrrt junction protein 3 (zona ocdudens 2.43 

122812 AA461044 Hs. 142980 EST 2.43 

135100 AA398926 Hs.251108 Homo sapiens mRNA; chromosome 1 specific 2.43 

113464 T86931 Hs.16295 ESTs Z43 

100045 M11507 AFFX control: transferrin receptor 2.43 

128975 AA092129 Hs.107538 ESTs; Moderately simtor to /prediction 2.43 

103688 AA011479 Hs.154701 ESTs 2.43 

127331 F20186 HSPD05873 HM3 Homo sapiens cDNA dons 05 2.43 

107337 T97111 Hs. 191 235 ESTs; Weakly similar to Ydr324cp [S.cere 2.43 

122171 AA435750 Hs.98830 EST 2.43 

107601 AA004636 Hs.50223 ESTs Z43 

119800 W73523 Hs.58314 ESTs 2.43 

104886 AA053348 Hs.144626 growth differenfetion factor 11 2.42 

122899 AA469960 Hs.178420 ESTs;H^hryslmaartoWA^frrtBracfinfl 2.42 

125933 AI308037 Hs.84120 ESTs; Weakly similar to nudeoporin p62 2.42 

121664 AA417291 Hs.97978 ESTs Z42 

125450 AA377194 Hs.238909 ESTs; Weakly similar to POLYPOSIS LOCUS Z42 

114611 AA081374 Hs.108110 DKFZP547E2110 protein 2.42 

111595 R11492 Hs.191225 ESTs 2.42 

111671 R19368 Hs.229084 EST 242 

110687 H93005 Hs.177311 ESTs Z42 

103019 X53414 Hs.144567 aianine-glyoxylate aminotransferase (oxa 2.42 

119076 R36634 Hs.235534 ESTs 2.42 

130589 AA234308 Ks.16441 DKFZP434H204 protein 2.42 

125975 AA495891 Hs.152290 ESTs; Highly similar to PACAP typ*3/VIP 2.42 

106380 AA446188 Hs.16614 ESTs 2.41 

121965 AA429652 Hs.104901 EST 2.41 

121604 AA416788 Hs38259 EST 2.41 

100885 HG4490KT4876 Prollne-Rich Protein Prb4, Allele 241 

117807 N487D1 Hs.46523 EST Z41 

119840 W79525 H&58586 ESTs Z41 

102458 U48861 Hs.54397 cholinergic receptor, nicotinic; beta po 2.41 

116152 AA460920 Hs215683 ESTs; Moderately similar to III! ALU SUB 2.41 

126741 AA522512 Hs.29759 Homo sapiens mRNA; cDNA DKFZp5B6L2123 (f 2.41 

103381 X92715 Ha3057 zinc finger protein 74 (Cos52) 2.41 

124837 R55630 Hs.233602 WAA0596 protein 2.41 

129322 AA437153 Hs.110407 ESTs; Weakly similar to coded for by C. 2.4 

129291 AA281930 Hs.110099 core-binding factor, nmt domain; alpha 2.4 

124789 R43803 Hs.78110 ESTs; Weakly similar to F1 7A9.2 fCefega Z4 ' 

133253 Y00970 Hs.183088 acrosin 2.4 

118990 N94447 Hs.55047 EST U 

134897 R71427 Hs.9081 phenytalanyt-tRNA synthetase beta-subunl Z4 

116572 D45654 Hs.65582 DKFZP586C1 324 protein Z4 

104294 D14539 Hs.234774 myeloid/lymphoid or mixed-lineage leukern 2.4 

116764 N74440 Ha.205264 ESTs 2.4 

117437 N27645 yw5e3.s1 Wefemarrn Olfactory EpHheGUm H 

3" similar to contains L1 .t3 L1 repetit - Z4 

111651 R16733 Hs.20499 ESTs 2.39 

109583 F02322 Hs.26135 ESTs 2.39 

125969 R94247 Hs.193879 ESTs Z39 

130647 AA457216 Hs214190 irrterieukin enhancer binding factor 1 2.39 

113708 T97467 Hs.18065 ESTs Z39 

133469 L03785 Hs.170482 myosin; light polypeptide 5; regulatory Z39 

116266 N62837 Hs.48647 lmmimogtobufirhBketrBnscript7 2.39 

121656 AA417246 Hs.98212 ESTs 2.39 

126530 A1422841 Hs.180086 ESTs Z39 
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123708 


AA609648 


Hs.207767 


EST 


249 


107875 


AA0253O6 


Ks.61162 


ESTs 


249 


111711 


R22891 


Hs.7093 


ESTs 


2.39 


131405 


U79255 


Hs.26468 


amyloid beta (A4) precursor protein-bind 


249 


127454 


AA502957 


Hs.153590 


ESTs 


249 


132341 


AA448419 


Hs.45209 


ESTs 


246 


133673 


D87673 


Hs.75486 


heat shock transcription factor 4 


248 


113213 


T58607 




ya94a02£l Stratagene placenta (#937225) 


246 


10S230 


AA429356 


Hs.12047 


ESTs 


248 


116692 


F09261 


Hs.66103 


ESTs 


248 


126197 


AA172284 


Hs.103657 


ESTs; Weakly simflar to CH-TOG PROTEIN [ 238 


115966 


AA446866 


Hs.71371 


ESTs 


2.38 


132636 


U65785 


Hs.5417 


oxygen regulated protein (150kD) 


248 


109965 


H09077 


Hs.30895 


EST 


138 


130203 


L14754 


Hs.1521 


immunoglobulin mu binding protein 2 


2.38 


131332 


R50487 


Hs.25717 


ESTs 


2.38 


119105 


R42357 


Hs.91453 


ESTs 


247 


129253 


W69316 


Hs.109778 


ESTs; Weakly similar to similar to beta- 


247 


113602 


T92558 


Hs.17036 


ESTs 


247 


118102 


N55272 


Hs.145798 


ESTs 


247 


100734 


HG3432-HT3620 


Fibroblast Growth Factor Receptor K-Sam. AIL Splice 3, K-Sam lil 


111533 


R08548 


Hs.251651 


EST 


247 


130813 


U12259 


Hs.198 


paired box gene 3 (Waardenburg syndrome 


247 


119180 


R80413 


HsJ9252Q 


ESTs 


247 


109335 


AA211443 


Hs.86492 


ESTs 


247 


107386 


U97698 


Hs.159593 


mucin 6; gastric 


246 


122486 


AA448328 


Hs.1 15527 


ESTs 


246 


112997 


T23548 


Hs.167467 


ESTs 


Z36 


109674 


F09051 


Hs.21837 


ESTs; Weakly simUarto KIAA0927 protein 


2.36 


128868 


AA423827 


Hs.106730 


hypothetical protein 


246 


127027 


R17261 




yg12g07.ii Scares Infant brain 1N1B H sa 


246 


123099 


AA485931 


Hs.79 


aminoacytase 1 


246 


115716 


AA416767 


Hs.43498 


ESTs; Weakly similar to ORF YKL201c [S.c 


246 


130830 


D86982 


Hs.20050 


K1AA0229 protein 


246 


109051 


AA159920 


Hs.72322 


ESTs 


246 


130181 


R39552 


Hs.151608 


Homo sapiens done 23622 mRNA sequence 


246 


131114 


R46233 


Hs^3107 


ESTs 


246 


123589 


AA609047 


Hs.188922 


ESTs 


246 


130872 


U03891 




phorbolin (similar to apoBpoprotein 8 m 


246 


131962 


H78550 


H&2780 


jun D proto-oncogene 


246 


130502 


M55067 


Hs.1583 


neutrophil cytosofc factor 1 (47kO; chr 


246 


121785 


AA423883 


Hs.142442 


ESTs 


245 


125405 


T97171 


Hs.121570 


ESTs 


245 


103682 


AA000993 




ESTs 


2.35 


125649 


T77395 


Hs.194816 


stomaWke protein 1 


Z35 


115452 


AA285019 


Hs.55263 


ESTs; Highly simflar to mitochondria] di 


245 


129338 


T56800 


Hs.47274 


Homo sapiens mRNA;cDMAOKF2p564B1 76 (fir J 


106105 


AA421268 


Hs.149443 


putative tumor suppressor 


245 


134770 


R72079 


Hs.89575 


CD79B antigen (irrummcglcoulirvassodated Z35 


119422 


T99496 


Ha .229598 


EST 


2.35 


109869 


H02849 


Hs.30345 


EST 


245 


134314 


AA263032 


Hs.81634 


ATP synthase; H+ transporting; mftochond 


245 


114989 


AA251097 


Hs.189119 


ESTs 


245 


122619 


AA453755 


Hs.191515 


ESTs 


245 


133129 


AA428580 


Hs.65551 


ESTs 


245 


128465 


AA416762 


Hs.100221 


nuclear receptor subfamily 1; group H; m 


245 


115636 


AA402715 


Hs.58389 


ESTs 


245 


130835 


J05068 


Hs.2012 


transcobaiamin I (vitamin B12 binding pr 


244 


132385 


Y10256 


Hs.47007 


serine/threonine protetn-kfnase 


234 


107776 


AA018820 


Hs.221147 


ESTs 


244 


109791 


F10669 


Hs.13228 


DRE-antagonist modulator catsenilin 


244 


124409 


N33212 


Hs.107197 


ESTs 


244 


131068 


AA397916 


Hs.22595 


ESTs 


244 


121079 


AA398719 


Hs.14169 


ESTs; Weakly similar to CREB-bindlng pro 


244 


124662 


N94340 


Hs.171835 


ESTs; Weakly smir to PUT PRE-MRNA SPUC! i 


133820 


M13666 


Hs.177582 


surfactant; pulmonary-associated protein 


244 


129424 


M55593 


Hs.1 11301 


matrix metaJioproteinase 2 (gelatinase A 


244 


109066 


AA161377 


Hs.72404 


EST 


244 


100339 


D63485 


Hs.181359 


KIAA0151 gene product 


2.34 


100809 


HG3991-HT4261 


Cpg-Enriched Dna, Clone E18 


234 


120844 


AA349417 Hs.96917 


ESTs 


243 


124927 


R96146 


Hs.221459 


ESTs 


243 


109779 


F10527 


Hs.3353 


Homo sapiens done 24940 mRNA sequence 


243 


101171 


L16842 


Hs.119251 


ubiquinokytochrome c reductase core pr 


243 


110805 


N26904 


Hs^4048 


ESTs; Weakly slmilarto l^flraparrrydn- 


243 


125440 


AI090982 


Hs.31895 


ESTs 


2.33 
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133159 AC000061 Hs.663 cystic fibrosis transmemb conductance re 2.33 

MB29 M91366 Hs.129763 solute carrier ramify 8 (sodium/calaum £33 

126492 AA778555 Hs.142505 ESTs 233 

102774 U83303 Hs.164021 small inducible cytokine subfamily B (CX 233 

130480 N50809 Hs.15760 ESTs; WeaWy similar to similar to Yeast 2.33 

126878 AW24759 Hs.238928 ESTs 2.33 

117338 N23889 Hs.43466 ESTs 2.32 

118662 N70877 Hs.13055 ESTs 232 

130354 AA416685 Hs.155001 UNC13 (C. e!egans)-fike 2.32 

106760 AA477330 Hs.12293 ESTs 2.32 

124294 H90573 Hs.102298 EST 2.32 

119428 W02129 Hs.55242 EST 2.32 

132629 Z40942 Hs.5383 ESTs 2.32 

127998 AA854161 Hs.143585 ESTs 2.32 
132728 AA293334 Hs.5566 ESTs; Highly similar to RAS-RELATED PROT232 

120292 AA189116 H&96168 ESTs 2.32 

107598 AA004528 Hs.169444 ESTs 2.32 

128164 AW78174 Hs.144846 ESTs 232 

105753 AA299789 Hs.15277 ESTs 2.31 

131256 AA262340 Hs24907 coronln; actin-binding protein; 2B 2.31 

110891 N38863 Hs234392 platelet-activating factor acetyfhydrola Z31 

116767 H13689 Hs£2530 ESTs 231 

100545 HG2147-HT2217 Mucin 3, Intestinal (Gb:M55405) 231 

125264 W88995 Hs.167641 ESTs;WeaWysIrnilartoC15H9i[C.e!ega 231 

118387 N64579 yz51d1U1 MortonFetal Cochlea Hsapien 231 

104335 D83847 Hs.183864 e!astase3B 231 

107464 W42944 Hs.171939 ESTs 231 

112304 R54798 Hs26239 ESTs 2.31 

134313 AA136100 Hs.6673 trinucleotide repeat containing 15 231 

116322 AA490900 HsJ8643 ESTs; Highly similar to JAK3B [Rsapiens 231 

111275 N70970 Hs35006 ESTs 2.31 

100109 AJ000480 Hs.143513 prtc^oproteinregubtedbymltogenicpa £31 

109338 AA211717 Hs.86507 ESTs 231 

134432 AA053022 Hs.8312 ESTs 231 
129649 AO0O0092 Hs.182628 Homo sapiens DNA from chr 19p13.2 cosmid 

EKLF; GCDH; CRTC; and RAD23A genes; gen 

122623 AA453990 Hs.99248 ESTs 231 

112070 R43976 Hs.236310 EST 231 

127683 AA668123 Hs.134170 ESTs 231 

104920 AA057620 Hs30807 ESTs; Highly similar to dJ1 8601,1 [H.sap 231 

106064 AM17373 Hs.15898 ESTs 2.31 

106782 AA478487 ESTs 231 

126709 AA028159 Hs.47234 ESTs 23 

105129 AA158386 Hs.186476 ESTs 23 

105719 AA291644 Hs36793 ESTs 23 

121698 AA418399 Hs.10351 K1AA0308 protein 23 

119069 R27619 Hs.231046 EST 23 

130388 U72515 Hs. 189583 putative protBin simfiar to nessy (Droso 23 

103444 X98801 Hs.74617 dynactinl (p1 50; Glued (Drosophtla) hom 23 
1 14604 AA076128 zm1Bg4.s1 Stratagene pancreas (#93728) H 

3' similar to SW:RS1A_HUMAN P3927 4S Rl 23 

103878 AA227635 H&202588 ESTs 23 

105828 AA398276 Hs.11962 ESTs 2.3 

119778 W72920 Hs.58244 ESTs 23 

120401 AA234309 Hs.193011 ESTs 2.3 

116290 AA488691 Hs.57969 pheny!a!anlne-(RNA synthetase 2.3 
130479 R44163 Hs.12457 Homo sapiens clone 23770 mRNA sequence 23 

104253 AF002672 Hs. 152944 bssof heterozygosity; 11; chromosomal 229 

132615 H66367 Hs.53358 ESTs; Weakly similar to MI ALU SUBFAMI 229 

121954 AA429598 Hs.98587 ESTs 229 

101336 L49169 Hs.75678 FBJ murine osteosarcoma viral oncogene h 2.29 

127247 AA313802 Hs.6289 growth factor receptor-bound protein 2 2.29 

117300 N22565 Hs.43212 ESTs 229 

122229 AA436198 Hs.103902 ESTs 23 

125105 T95766 Hs.189760 ESTs 2.29 

128083 R16100 Hs.166476 ESTs 229 

131279 AA089853 Hs.25197 STIP1 homology and U-Box containing prot 229 

133838 M97796 Hs.180919 Inhibitor of DNA binding % dominant neg 229 

111837 R36447 Hs24453 ESTs 229 

111435 R01620 Hs.19198 ESTs 229 

123613 AA609158 Hs.112656 EST 229 

133560 AA256365 Hs.7486 protein expressed In thyroid 229 

122896 AA469952 Hs.97899 ESTs; Weakly sirnSar to dal2;len343;C 229 

113378 T80627 Hs.14757 ESTs 229 

127174 AA293204 Hs.139352 ESTs 229 

120153 Z39582 Hs.65777 EST 229 
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112741 R93080 H&35035 ESTs 2.26 

132152 AA044784 Hs.4105 Homo sapiens mRNA; cDNA DKFZp586A0618 (f 228 

109790 F10665 Hs25031 ESTs 228 

113776 W04657 Hs^4248 ESTs 228 

102934 X13451 Hu mRNA for lymphocyte Bneage-cstrcted 228 

126168 AA322034 EST2469.0 Cerebellum II Homo sapiens cDNA228 

126363 N94706 Human Chromosome 16 BAC clone CfTWSK-A 228 

101427 M19508 Human myeloperoxidase gene, exons 1-4 228 

132616 AA386264 Hs.5337 Isodtrate dehydrogenase 2 (NADP+); mto 228 

105537 AA258813 Hs.27160 ESTs 228 

126527 AA548559 Hs.103853 ESTs 228 

115359 AA281936 Hs.88914 ESTs 228 

108474 AA079667 zm93d1.s1 Stratagene ovarian cncr (#9372 228 

120685 AA291066 Hs.105099 ESTs 228 

126171 AA704771 H&191942 ESTs 228 

112858 T02963 Hs.4454 ESTs 228 

121817 AM24826 Hs.98475 EST 228 

107695 AA026150 Hs.61384 ESTs 228 

131161 Z38223 Hs.23735 potassium voltage-gated channel; subfami 228 

135173 M72885 H&95910 Human G0S2 protein gene; complete cds 227 

103182 X69819 H&99995 intercellular adhesion molecule 3 227 

113889 W72720 Hs.194347 ESTs 227 

128984 AA319615 H&238030 secretory earner membrane protein 2 227 

101531 M29877 Hs.576 fucosldase; alpha-L-1; tissue 227 

115916 AA436889 H&91910 ESTs 227 

129892 H96850 Hs.89674 doQcnylndiphosohoollgosaccharia^protel 2^7 

103035 X54871 Hs.77690 RAB5B; member RAS oncogene family 227 

126479 T7B141 ESTs 227 

125778 R71976 Hs.161791 ESTs; WeaMy similar to ill I ALU SUBFAMI 227 

108132 AA053586 Hs.63048 ESTs 227 

111017 N53965 Hs256327 ESTs 227 

127165 AA359719 Hs.127121 ESTs 227 

126446 AI421309 Hs.1 18926 DKFZP586KD919 protein 226 

107864 AA025061 Hs.61246 ESTs 226 

122277 AA437133 Hs.98936 ESTs . 226 

115604 AA400378 Hs.49391 ESTs 226 

105061 AA134824 Hs.4865 ESTs 226 

118549 N68163 Hs.49455 EST 226 

110509 H56493 Hs.61960 ESTs; Moderately similar to HYPOTHETICAL 226 

114088 Z36280 Hs£6971 Human Chromosome 16 BAC clone CIT987SK-2 226 

103225 X74837 Hs.2750 mannosldase; alpha; class 1A; member 1 226 

125842 AA746654 Ks.5181 proBferaforvassociated 2G4; 38 kD 2.26 

104538 R25069 Hs.175681 ESTs 226 

130304 U09368 Hs.154205 zinc finger protein 140 {clone pHZ-39) 226 

120680 AA290743 Hs.97242 ESTs 226 

124062 H00440 Ks.144524 ESTs; Weakly similar to signal transduce 226 

103289 X80915 Hs.1573 growth differentiation factor 5 (car©a 226 

109286 AA197273 Hs.191324 ESTs 226 

128555 U62739 Hs.101408 branched chain aminotransferase 2; mitoc 226 

129439 AA171694 Hs.111461 ceruloplasmin (fenoxidase) 226 

109221 AA192755 H&85840 ESTs; Weakly simBar to stac [H sapiens] 226 

109906 H05084 r&28077 ESTs; Highly srmflar to GDP-mannose pyro 2.26 

130540 U35234 Hs.1 59534 protein tyrosine phosphatase; receptor t 226 

122870 AA465158 Hs.1 92861 Sp^B transcription factor (Spi-1 /P U. 1 r 226 

120219 241124 Hs.66045 EST 226 

128021 AI001135 Hs.78223 N^cylaminoacyl-peptide hydrolase 226 

121732 AA421047 Hsi8330 ESTs 226 

107817 AA020781 Hs.60847 ESTs 225 

101069 L02648 Hsi4232 transcobalamin 11; macrocytic anemia 225 

103065 X58399 Hs£1221 Human L2-9 transcript of unrearranged im 225 

118019 N52585 Hs.47517 ESTs 225 

122220 AA436011 Hs.98187 ESTs 225 

109161 AA179392 Hs.73601 EST 225 

128699 K03207 Hs.103972 proline-rich protein BstNi subfamily 4 225 

101914 S71824 Hs.167988 neural ceQ adhesion molecule 1 225 

102697 U74667 Hs.6364 Tat interactive protein (60kD) 225 

119939 W86753 Hs.82407 ESTs 225 

127793 AI298835 Hs.30445 ESTs; Weakly similar to transcription re 225 

104450 L77564 Hs.103976 serine/threonine kinase 22B (spermiogene 225 

133096 AA136042 Hs.131053 ESTs 225 

115416 AA283893 Hs.203866 ESTs 225 

117056 H90322 Hs.41357 EST 225 

115598 AA400129 Hs.65735 ESTs 225 

121267 AA401397 Hs.165296 ESTs; Highly similar to kalScrefn-Bke 225 

104778 AA026397 Hs.1 1039 Homo sapiens done 24804 mRNA sequence 225 

110926 N48252 Hs.135287 ESTs 224 
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102795 U88667 Ks.198396 ATP-blndlng cassette; sub-family A (ABC1 224 

118643 N70324 Hs.49840 ESTs 224 

103304 X82240 Hs2464 T-ceO teukemla/lymphoma 1A 2.24 

134814 Z46475 Hs.89771 glucokinase (hexoklnase 4) regulatory pr 2.24 

125912 AA171719 Hs.5233 eukaryotic translation initiation factor 124 

134385 R32377 Hs.82240 syntaxln3A 224 

117224 N20300 Hs.218707 ESTs 224 

107169 AA621601 Hs.184446 ESTs; Weakly similar to small GTP-bindin 224 

133948 M59916 Hs.77813 sphingomyelin phosphodiesterase 1;add 224 

101426 M19483 Hs25 ATP synthase; H+ transporting; mitochond 224 

119922 W86196 Hs.177384 ESTs 224 

123361 AA504810 Hs.139649 EST 224 

123915 AA621298 Hs.112967 ESTs 224 

123540 AA608792 Hs.112591 EST 224 

124978 T40560 Hs221759 ESTs 224 

102354 U38268 Human cytochrome b pseudogene, partial c 224 

124198 H53099 Hs.198271 NADH dehydrogenase (ubiquinone) 1 alpha 224 

102160 U18235 Hs.121561 ATP-biriding cassette; sub-family A (ABC1 2.24 

107520 X76091 Hs.100007 regulatory factor X; 2 {influences HLA c 224 

131589 U52100 Hs.29191 eplthefial membrane protein 2 224 

126633 AA206993 Hs.154145 guanine nud binding protein (G protein) 223 

130887 AA258379 Hs.155986 angiotensin receptor* 2 223 

119894 W84670 Hs.58518 EST 223 

124544 N63837 Hs.40500 similar to S.cere^siaeRERI 223 

103104 X61587 Hs.75082 ras homolog gene famSy; member G (mo G 223 

110119 H17306 Hs.177229 ESTs 223 

131411 AA464043 Hs26506 ESTs; WeaWy similar to NY-REN-45 antige 223 

102346 U 37359 Hs. 227297 metotic recomb^ation (S. cerevsiae) 11 223 

106003 AA411167 Hs.8734 ESTs; Moderately similar to !!!! ALU CLA 223 

122564 AA452251 Hs.98669 ESTs 223 

133688 U42031 Hs.7557 FK506-binding protein 5 223 

132096 AA131410 Hs.3964 Homo sapiens clone 24877 mRNA sequence 223 

110038 H11746 Hs31097 ESTs 223 

123788 AA620293 Hs.1 12853 ESTs 223 

135070 X99350 Hs.93974 forkheadboxJI 223 

104908 AA055841 Hs.154396 ESTs 222 

128674 AA025001 Hs.169452 ESTs 222 

100810 KG3992-HT4262 Cog-Enriched Dna, Clone E35 222 

120065 W93579 Hs.59478 EST 222 

122775 AA459592 Hs.112143 ESTs 222 

125443 H71482 Hs.177592 rfcosomal protein; large; PI 222 

118617 N69666 Hs.1 83413 ESTs; Moderately simitar to till ALU SUB 222 

128001 AI167814 Hs.166664 ESTs 222 

128160 AI279080 Hs.149971 ESTs; Moderately similar to !!!! ALU CLA 222 

106608 AA458644 Hs27115 ESTs 222 

103485 Y08409 Hs248415 thyroid hormone responsive SPOT14 (rat) 222 

135008 AA173423 Hs.92918 ESTs; WeaWy simflar to R07G3.8[C.elega 222 

110122 H17333 Hs.159837 EST 222 

128397 AJ393421 Hs.14032 ESTs 222 

110231 H24359 Hs28733 ESTs 222 

123188 AA489092 Hs.177726 ESTs 222 

131903 AA481723 Hs3436 deteted in oral cancer (mouse; homolog) 222 

122649 AA454616 Hs.90336 ATPase; H+ transporting; lysosomal (vacu 222 

133090 AA448228 Hs.6468 ESTs 222 

108002 AA037664 Hs.55067 ESTs; Weakly similar to T07F121 gene pr 222 

133120 X64559 Hs.65424 tetranectin (plasrninogen-binajng protein 221 

114263 Z40073 Hs.6045 ESTs 221 

125518 R20148 Hs.193851 ESTs 221 

128613 U78551 Hs.102482 Homo sapiens gallbladder mucin MUC5B mRN 221 

102773 U83192 Hs23731 decs; large (Drosophila) homolog 4 221 

119526 W38049 Accession not Bsted in Genbank 221 

126844 AA299325 EST1 1903 Uterus tumor I Homo sapiens cDN 221 

105860 AA399251 Hs 180933 ESTs; WeaWy similar to methyl-CpG facndi 221 

126957 AA733145 H&194560 ESTs 221 

108959 AA150107 Hs^1810 ESTs 22 

131663 AA423926 Hs.30318 ESTs 22 

127468 H02941 Hs.8888 ESTs 22 

104483 N42776 Hs.146233 ESTs 22 

123848 AA620773 Hs.221996 ESTs 22 

101623 M55905 Hs.75342 malic enzyme % NAD(+hdependent mftoch 22 

120872 AA357993 Hs.96996 ESTs 22 

135033 AA173241 Hs.93454 ESTs 22 

122286 AA437259 Hs.104944 EST 22 

114862 AA235174 Hs.50250 ESTs 22 



100255 D38047 Hs.78466 proteasome (prosome; rracropaln) 26S subu 22 
103063 X58234 Hs.123178 trarelocaseofmr»rmitochondn^lmembr 22 
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Human B-cell receptor associated protein 
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AA194266 


Hs.178393 


COTS 


O O 
LI 


122081 


AA431992 


Hs.104920 


ESTs 


2.19 


101798 


M85220 




Accession not listed In Genbank 


2.19 


111501 


R07444. 


Hs.163118 


ESTs 


2.19 


132832 


D63482 


Hs.57734 


WAA0148 gene product 


2.19 


100544 


HG2147-H72217 


Mucin3Jntestlnal(GbJ^55405) 


2.19 


106835 


AA482077 


Hs.33713 


ESTs; Weakly simitar to hypothetical pro 


2.19 


132934 


AA076145 


Hs.61053 


ESTs 


2.19 


108762 


AA127515 


Hs.71787 


ESTs; Highly similar to 30S ribosomal pr 


2.19 


120164 


Z39733 


Hs.158159 


FAT tumor suppressor (Drosophila) homolo 


Z19 


135395 


L08096 


Hs.99899 


tumor necrosis factor (tfgand) superfemi 


2.19 


101717 


M69013 


Hs.1686 


guanine nucleotide binding protein (G pr 


2.19 


121172 


AA400013 


Hs.97750 


EST 


2.18 


114861 


AA235123 


Hs.40719 


ESTs 


2.18 


120851 


AA349662 


Hs.174248 


ESTs 


2.18 


121083 


AA39B736 


Hs.97653 


EST 


2.18 


107171 


AA621624 


Hs.28038 


Homo sapiens done 24515 mRNA sequence 2.18 


128754 


D31446 


Hs.10488 


Breakpoint duster region protein; uteri 


218 


100149 


D13897 


Hs.169249 


peptide YY 


•2.18 


132405 


AA323787 


Hs.4770 


WAA1068 protein 


2.18 


114666 


AA1 12274 




zm27g6.s1 Stratagene pancreas (#93728) H 










etementcontalns element LTR8 repetifiv 


Z18 


127008 


AA223879 




zr10g05.rl Stratagene MT2 neuronal precu 


2.18 


110373 


H42896 


Hs.29438 


ESTs 


2.16 


119354 


T66942 


Hs.100651 


golgl SNAP receptor complex member 2 


2.18 


130115 


M31627 


Hs.149923 


X-box binding protein 1 


2.18 


130514 


M161085 


Hs.15871 


ESTs; Weakly similar to add phosphatase 


2.18 


128848 


H08077 


Hs.217179 


ESTs; Weakly similar to T27A1 .5 [Cetega 


2.18 


110161 


H19312 


Hs.28096 


ESTs 


2.18 


132367 


X82224 


Hs.46634 


cysteine conjugate-beta lyase; cytoplasm 


2.18 


125882 


H45538 


Hs.101448 


metastasis associated 1 


2.17 


113837 


W57698 


H&8888 


ESTs 


2.17 


106376 


AA444C04 


Hs.6084 


ESTs 


2.17 


113755 


T99075 


Hs.18570 


ESTs 


2.17 


107525 


X91817 


Hs.102866 


transketolasd-fike 1 


2.17 


119207 


R93186 


H&84298 


CD74 antigen (invar polypept of maj hist 


2.17 


131862 


AA236365 




3-phosphogtycerate dehydrogenase 


2.17 


115514 


AA297739 


Hs.55609 


ESTs; Weakly similar to ISOLEUCYL-TRNA S2.17 


112290 


R53940 


Hs^6016 


ESTs 


2.17 


126136 


H83353 




yv82fQ2.r1 Soares melanocyte 2NbHM Homo 2.17 


121574 


AA412712 


Hs.1 19325 


HuntingtirHUiteracting protein A 


2.17 


118530 


N67900 


Hs.1 18446 


ESTs 


2.16 


132327 


AA203285 


Hs.44892 


ESTs; Weakly simSar to dJ733D15.1 [H.sa 


2.16 


100564 


HG2239-HT2324 


Potassium Channel Protein (Gb21 1585} 


2.16 


129376 


AA022622 


Hs.13543 


ESTs; Weakly similar to hypothetical pro 


2.16 


135317 


X86012 


Hs.98602 


Human DNA sequence from intron 22 of the 










9.5kb repeated region; int22h-1; involv 


Z16 


114973 


AA250845 


Hs.87762 


ESTs 


2.16 


107559 


AA001504 


Hs.59860 


ESTs 


2.16 


111014 


N53787 


Hs.191117 


ESTs 


2.16 


101250 


L34060 


Hs.79133 


cadherin8 


2.16 


110697 


H93721 


Hs^)798 


ESTs 


2.16 


126843 


AA450166 


Hs.22641 


ESTs; Moderately similar to predicted pr 


2.16 


108272 


AA063616 


Hs.43773 


ESTs 


2.16 


125012 


T66935 


Hs.104859 


ESTs 


Z16 




R16101 


Hs.140834 


EST 


2.15 


123157 


AA488443 


Hs.100426 


DKF2P564A063 protein 


2.15 


102315 


U34252 


Hs.2533 


aldehyde dehydrogenase 9 (gamma-amlnobut2.15 


131897 


AA287623 


Hs.3426 


GTPase; human homotog of E. coli essenti 


2.15 


121528 


AA412253 


Hs.238909 


ESTs; Weakly simitar to POLYPOSIS LOCUS 2.15 


122806 


AA460707 


Hs.106397 


ESTs 


2.15 


125727 


H00958 


Hs.181641 


ESTs 


2.15 


133279 


AA069571 


Hs.6957 


Homo sapiens clone 24616 mRNA sequence 2.15 


103219 


X74570 


Hs.75268 




2.15 


120881 


AA362144 


Hs.104601 


EST 


2.15 


134080 


D42039 


Hs.78871 


WAAO0B1 protein 


115 


106598 


AA457140 


Hs.11411 


DKF2P566O084 protein 


2.15 
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125576 R66208 yi30h03.rl Scares placenta Nb2HP H sapie 

contains Atu repetitive etementcontain 2.15 

126727 AA037230 Hs.135084 cystatin C (amyloid angiopathy and cereb 2.15 

101490 M25629 Hs.123107 kaltain 1; renaVpancTBas/saDvaiy 2.15 

129708 AA417181 Hs.120858 ESTs Z14 

100627 HGZ702-HT2798 Serine/Threonine Kinase (Gb225424) 2.14 

121703 AA418671 Hs.104807 ESTs 2.14 
106809 AA479704 Hs220324 HumnDNAseqfrmdone283E3onchr1p36 

Female Reproductive tract M1FR1 ; -2; MM 2.14 
129525 F03873 Hs.112306 Homo sapiens clone 24955 mRNA sequence; 2.14 

100478 HG10674fT1067 Mucin (Gb:M22406) 2.14 

118593 N69020 Hs.207689 EST 2.14 
114047 W94427 Hs.3807 ESTs; WeaJdy similar to PHOSPHOLEMMAN PR Z14 

128823 AA478207 Hs.10632 ESTs; Moderately simSar to sex-deiermin 2.14 

100534 HG1980-HT2023 Tubulin, Beta 2 - 2.14 

105757 AA321146 Hs.30596 ESTs 2.14 

109617 F03192 Hs.26789 ESTs; Weakly similar to (1J162H14.1 [H.sa 2.14 

121547 AA412448 Hs.104777 ESTs 2.14 

119420 T98291 Hs.102484 glutathiones-transferase A3 2.14 
120274 AA177051 ncO2a0Zs1 NCLCGAP_pr3 Homo sapiens cDN 

repetitive element;contains element LTR 2.14 

132933 AA598702 Hs.6101 bone morphogenetic protein 6 Z14 

133405 X07881 Hs.73031 proBne-rich protein BstNl subfamily 3 2.14 

119811 W73922 Hs.49047 ESTs 2.14 

134536 AA457735 Hs.850 IMP (inosine monophosphate) dehydrogenas 2.14 

105125 AA157799 Hs.6980 aldo-keto reductase family 7; member A2 2.14 

101398 M15881 Hs.1137 uromoduiin (uromucoid; Tamm-Horsfall gly 2,14 

132751 AA397901 Hs£5993 ESTs 2.13 

115777 AA424142 H&39384 putative secreted Bgand homologous to f 2.13 

123193 AA489228 Hs.136956 ESTs 213 

116875 H67749 Hs.161022 EST 2.13 

107271 D60607 Hs.34931 EST 2.13 

134551 R44839 Hs.8526 hbeta-1;3^c«tylglucosammyitransfera Z13 

113413 T83739 Hs.186512 ESTs 2.13 

120522 AA258843 Hs.258748 ESTs 2.13 

. 119965 W87738 Hs.59039 EST 2.13 

131283 AA101601 Hs.183986 herpesvirus entiy mediator B (poxvirus 2.13 

107347 U43628 Hs.102598 mucosal vascular addressin ceil adhesion 2.13 

116490 C14265 Hs.66450 ESTs 2.13 

100563 HG2239-HT2324 Potassium Channel Protein (GbZ1 1585) Z13 

110441 H5O302 Hs.19845 ESTs; Highly similar to protein phosphat Z13 

101035 J05158 Hs.73858 carboxypeptidase N; polypeptide t, 83kD 2.13 

132500 AA047297 Hs.50107 ESTs; Moderately similar to CDO (Usapie 2.13 

129807 L34820 H&5299 aldehyde dehydrogenase 5 family; member 2.13 

106250 AA430466 Hs.28890 ESTs 2.13 

113569 T91086 Hs.162070 EST 2.13 

122911 AA470087 Hs.239726 ESTs Z13 

107452 W28988 Hs.250746 ESTs 2.12 

111824 R35661 Hs.25006 EST 2.12 

132831 U53442 Hs.57732 rrutogervacth/ated protein kinase 11 2.12 

110244 H26742 Hs.25367 ESTs; Weakly similar to ALR [H.saptens] 2.12 

128918 H85347 Hs.107164 spectrin; beta; ron^rythrocyticl Z12 

133728 M10901 Hs.75772 nuclear receptor subfamily 3; group C; m 2.12 

122476 AA448211 Hs.99164 ESTs Z12 

132004 137360 Hs.37054 ephrirhAS 2.12 

113971 W86760 Hs320682 ESTs Z12 

103386 X92972 rte.80324 protein phosphatase 6; catalytic subunit Z12 

131120 AA443676 Hs.23133 ESTs; Weakly similar to alcohol suHbtra 2.12 

102186 U20285 G protein pathway suppressor 1 2.12 

103694 AA018541 Hs.60580 zinc finger protein 2.12 

111995 R42333 Hs.20893 ESTs 2.12 

124436 N39596 Hs.182584 ESTs 2.12 

100306 D50495 Hs.80598 transcription elongation factor A (SIl); 2.12 

103034 X59932 Hs.77793 csrc tyrosine kinase 2.11 

115092 AA255903 Hs.80975 CD39-Iike4 2.11 

121579 AA416543 Hs.111981 ESTs 2.11 

127101 AI349351 Hs.118944 ESTs 2.11 

121195 AA400273 Hs.97791 ESTs 2.11 

112721 R91484 Hs.30853 ESTs 2.11 

113253 T64207 Hs.55296 HLA-B associated transcript-1 2.11 

120838 AA348887 HsJ6907 ESTs 2.11 

114122 Z38582 Hs.12751 ESTs 2.11 

112635 R82298 Hs^9497 ESTs 2.11 

103785 AA095600 Hs^25647 ESTs 2.11 
128260 AA331445 EST35277 Embryo, 8 week I Homo sapiens c 2.11 

122987 AA479155 Hs.103364 ESTs 2.11 
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110374 H42983 Hs227263 ESTs 2.11 

116595 D60625 Hs.177656 calmodulin 1 (phosphorylase kinase; delt Z11 

126117 H78617 yu26a08.r1 Scares fetal liver spleen 1NF 2.11 

116610 D80448 Hs.45177 ESTs 2.11 

111430 R01248 Hs.19165 ESTs 2.11 

106700 AA463929 H&28701 ESTs 2.11 

120181 Z40121 Hs.65870 ESTs; Weakly similarto Pro-PoWUTPase 2.1 

132545 AA147218 Hs.5105 ESTs 21 

105005 AA115253 Hs.28805 ESTs 2.1 

126702 U54602 Hs.2785 keratin 17 2.1 

124096 H10060 Hs.101687 EST 2.1 

132720 Z69881 Hs.5541 ATPase; Ca++transportrng; ubiquitous 2,1 

121926 AA428559 Hs.104895 ESTs 21 

125734 AA157445 H&227391 DKFZP547E1010 protein 2.1 

122368 AA443963 Hs.104S64 EST 2.1 

116910 H72014 Hs.161031 ESTs; Weakly simflar to SYNAPTGTAGMIN I 2.1 

113171 T54613 Hs.9761 EST 2.1 

134629 U00951 Hs.871 50 Human done A9A2BR1 1 {CAC)n/(GTG)n repea 2.1 

105712 AA291293 Hs.25219 ESTs 2.1 

106931 AA495918 Hjl26714 ESTs 2.1 

114278 Z40424 Hs.27728 ESTs 2.1 

116615 D80866 Hs.45203 ESTs 2.09 

100189 D21089 Hs.320 xeroderma pigmentosum; complementation g 2.09 

119500 W37694 Hs£5561 ESTs 2.09 

129605 S72493 Hs.115947 keratin 16 (focai non-epidermolytic palm 2.09 

133912 X62744 Hs.77522 major histocompatibffity complex; class 209 

129836 N34942 Hs.11782 ESTs 209 

106372 AA443941 Hs.4992 tumor suppressing subtransferabte candid 2.09 

101885 M98539 Hs.8272 prostaglandin D2 synthase (21 kD; brafn) 2.09 

132749 AA235989 Hs.55967 short stature homeobox 2 2.09 

135042 X91348 Hs.93522 putative norvcoding transcript (DiGeorge 2.09 

109404 AA224594 Hs.86941 ESTs 209 

101333 L47738 Hs.80313 p53 Inducible protein 209 

100114 D00596 Hs.82962 myrnidylate synthetase 2.09 

130536 T17045 Hs.159492 spastic ataxia of Chartevoix-Saguenay (s 2.09 

125772 R83903 Hs.78040 KDEL (Lys-Asp-Glu-Leu) endoplasmic retic 2.09 

132192 AA247569 Hs.4209 ESTs 2.09 

124697 R06273 Hs. 186467 ESTs; Moderately similarto lii! ALU SUB 209 

127694 A1247780 Hs.117036 ESTs 208 

127895 AA772600 Hs.187998 ESTs; Weakly similar to ATP-blndlng cass 208 

121315 AA402883 Hs.82269 progestagen-associated endometrial prate 

endometrial aIpha-2-globuiin; alpha ute 2.08 

112150 R46576 Hs 23239 ESTs 208 

105054 AA133584 Hs.26333 JM1 protein 208 

113151 T51620 Hs.9326 EST 208 

118783 N75285 Hs.50593 ESTs; Moderately similar to cytoplasmic . 2.08 

126748 AA249580 Hs.239975 ESTs; Moderately similar to CDO [Rsapie 2.08 

135160 U77643 Hs.95655 secreted and transmembrane 1 2.08 

107518 X60152 zinc finger protein 2 2.08 

126055 N28990 yx39g04.r1 Soares melanocyte 2NbHM Homo 2.08 

116982 H81933 Hs.40317 ESTs 2.08 

101756 M77235 Hs.169331 sodium channel; vcJtage^ated; type V; a 2.08 

116935 H75763 Hs.53468 ESTs 2.08 
118556 N68408 Hs.194637 Homo sapiens mRNA; cDNA DKFZp564D113 {fr 2.08 

129812 L07807 Hs.166161 dynaminl 2.08 

121946 AA429411 Hs.104888 ESTs 2.08 
133843 AA489045 Hs.76691 Homo sapiens done 25100 mRNA sequence; 208 

122170 AA435744 Hs.163913 ESTs 2.08 

122399 AA446449 Hs^31112 EST 2.08 

105775 AA348274 Hs.6664 ESTs 2.08 

123943 AA621553 Hs.112998 ESTs 2.08 

105771 AA347967 Hs£56267 neuroblastoma RAS viral (v-ras) oncogene 2.08 

114454 AA021091 Hs.226208 ESTs 108 

125802 R78652 Hs.151099 ESTs 2.08 

131556 AA442853 Hs.2869 cyciiiwlependent kinase 5; regulatory su 2.08 

118837 N79836 Hs^16338 ESTs 2.08 

107345 U26209 Hs.102307 solute carrier family 13 (sodiunvdepende 208 

131324 H58690 Hs.25625 ESTs 2.08 

105233 AA216759 Hs.191132 ESTs 2.07 

112886 T03864 Hs.7436 putative acyltransferase 2.07 

1202S2 AA169400 Hs.152701 DKFZP434F1 24 protein 2.07 

114867 AA235310 Hs.52899 ESTs; Moderately similar to 111! ALU SUB 2.07 
106715 AA464955 Hs.126062 ESTs; Weakly similar to EPIDERMAL GROWTH 2.07 

125560 R51281 Hs.13692 ESTs; Highly simflar to PROTEIN TSG24 [M 2.07 

112270 R53021 Hs.203358 ESTs 2.07 

134626 S82198 Hs.8709 caldecrfn (serum calcium decreasing fact 2.07 
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115723 AA417345 Hs£4846 ESTs 2.07 

123895 AA621192 Hs.112949 EST 2.07 

119906 W85816 ESTs; Moderate^ similar to Mil ALU SUB 2.07 

108559 AA085161 zn1 2c5.s1 Stratagene hNT neuron (#937233 

IMAGE^47283 , sim!!artoTR:G1151228G 207 

101246 L33799 Hs£02097 procollagen G«ndopeptldase enhancer 2.07 

100663 HG2915-HT3059 Mapr Hbtooompatibiiity Complex, Class I. E (Gb:M20022) 2.07 
114178 Z39063 Hs.17930 HurmDNAseqfrmdone1033B10onchr6p 

for GaiT3 (beta3-Galactosyftransterase) - 2.07 

125672 AA152281 Hs.78601 uroporphyrinogen decarboxylase 2.07 

118052 N53360 Hs.165133 ESTs 207 

102387 U41163 Hs.229731 solute carrier family 6 (neuratransmitte 207 

127305 AA535148 H1255277 ESTs 207 

101182 L19711 Hs.76111 dystroglycan 1 (o^slrophlrhassodatedg! 2.07 

131111 R33245 Hs33076 ESTs; Weakly similar to putative [Celeg 2.07 

112441 R63388 Hs^8412 ESTs 2.06 

117796 N48571 Hs.46689 EST 206 

116099 AA456309 Hsi8B31 regulator of Fas-induced apoptosis 2.06 

125559 AA307550 Hs.119571 collagen; type Hi; alpha 1 (Ehlers-Danl 2.06 

135271 AA397763 Hs.97562 ESTs 205 

106083 AA418545 Hs.31659 thyroid hormone receptor-associatfid prot 206 

133419 U67369 Hs.73172 growth factor independent 1 206 

127816 AA743646 Hs.120604 ESTs 206 

127502 AA614422 Hs.163502 ESTs 206 

129371 M10321 Hs. 110802 von Wiffebrand factor 206 
108417 AA075716 zm89e5.s1 Stratagene ovarian cancer (#93 

CLUSTER1N PRECURSOR (HUMAN);, mRNA sequ 205 

102837 U94585 Hs.13495 requiem; apoptosis response zinc finger 206 

124226 H62396 Hs,1 90266 ESTs 206 
102254 U28131 Human HMGI-C chimeric transcript mRNA, p 206 

128472 X87212 Hs.10029 cathepsinC 206 

107545 Z82022 Hs^6433 doOchyl-phosphate (UDP^cetylglucosam 2.06 

135311 M36089 Hs.98493 X-ray repair complementing defective rep 2.06 

121727 AA420973 Hs.104234 ESTs 2.06 

131846 U02619 Hs.331 general transcripfion factor IHC; polyp 2.06 

120415 AA235810 Hs.182522 ESTs 206 

110529 H57686 Hs.37488 ESTs 206 
104996 AA112307 Hs.105894 Homo sapiens mRNA; cDNA DKFZp434G231 (fr 206 

110351 H41222 Hs.196459 ESTs 206 

131261 AA223746 Hs. 171776 inosltd(myoH(or4)^nopnosphatase1 206 

110585 H62223 Hs. 133526 ESTs; Weakly similar to ill! ALU SUBFAMI 206 

129420 AA234259 Hs.99816 ESTs 206 
103796 AA112595 HsJ31146 Human DNA sequence from clone 1042K10 on 

lyase (EC 43.22; Adenylosuccinase; AS 

3). Contains ESTs; STSs; GS 206 

119782 W72982 Hs.58262 ESTs 206 

108641 AA112059 APsyrrthass;H+traiisport^ 206 

134875 U66672 Hs.180513 ATP-btoding cassette; siib-farnfly A (ABC1 206 

106832 AA482015 H&30114 ESTs; Highly similar to C8 p-Uapiens] 206 

109403 AA224413 Hs.86937 ESTs 206 

115485 AA287667 Hs. 188304 ESTs 206 

102923 X12517 Hs.1063 small nuclear rfoonudeoprotein potypept 2.06 

123320 AA495792 Hs.139572 EST 205 

111901 R39066 Hs.17638 ESTs 205 

106558 AA455111 Hs.182447 heterogeneous nudear ribonudeoprotetn 205 

126885 AA293052 Hs.10101 ESTs; Weakly similar to coded for by C. 2.05 

113429 T85190 Hs. 179808 ESTs 205 

102270 U30255 Hs.75888 phosphogluconate dehydrogenase 205 

103204 X72475 Hs. 192989 Ksaptens mRNA for rearranged Ig kappa I 205 

106666 AA461072 Hs37916 ESTs 205 

100947 HG907-HT907 Mg44 2.05 

102578 U60666 Hs.57693 teste specific leucine rich repeat prot 2.05 

105827 AA398255 Hs.31520 ESTs 205 

122324 AA442830 Hs.98921 EST 205 

101025 J04823 Hs.81097 cytochrome c oxidase subunit VIII 205 

115861 AA431768 Hs.90259 ESTs; Weakly similar to afoha 1 [Hsapie 2XK 

108081 AA045306 Hs.42996 ESTs 205 

133994 X74929 Hs.242463 keratin 8 205 

119131 R46700 Hs.129692 ESTs; Moderately similar to 111! ALU SUB 2.05 

129793 AA300151 Hs.126857 ESTs 205 

101653 M60284 Hs.1 61305 tachykinin receptor 2 205 

120300 AA191648 Hs.131476 ESTs 2.05 
106519 AA453415 Hs.8763 Hu DNA sequence from clone 889N15 on chr 

Thymocyte Marker CTX; the possibly alte 205 

114291 Z40690 Hs.123666 Homo sapiens mRNA full length insert cON 2.05 
105747 AA293719 Hs.30251 ESTs; Weakly similar to GLUCOSE^PHOSPH 204 
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125325 


AA332944 


Hs.8402 


adenylate cyclase 3 


ZQ4 


4 4 GOTO 

11 9978 


W88623 


HS.5919D 


EST 


o nil 


102449 


U48231 


Hs.46348 


bradyklnln receptor B1 


2.04 


101454 


M21812 


Hs.50889 


myosin light chain 2 


2.04 


116086 


AA455904 


Hs.86023 


ESTs 


O AA 

2.04 


102297 


U32674 


Hs. 198252 


G protein-coupled receptor 9 


2.04 


130889 


D57622 


H&20985 


sin3-associated polypeptide; 30kD 


1 f\A 

ZQ4 


100196 


D21853 


Hs.79768 


KIAA0111 gene product 


2.04 


120967 


AA398111 


Hs.97503 


ESTs 


2.04 


105735 


AA293096 


H&32417 


ESTs 


2.04 


135031 


R41604 


Hs.9344 


ESTs; Weakly simitar to Hi! ALU SUBFAMI 


2.04 


104882 


AA052954 


H&29546 


ESTs 


2.04 


132619 


AA404565 


H&53447 


ESTs; Moderately similar to Idnesin figh 


104 


127993 


AA847856 


Hs.124565 


ESTs 


Z04 


116441 


AA620299 


Hs.91696 


ESTs 


2.04 


102272 


U30610 


Hs.41682 


killer cell lectMce receptor subfami 


2.04 


119566 


W38209 




Accession not listed toGenbank 


2.04 


116622 


081171 


Hs.45208 


ESTs; Weakly similar to collagen type VI 


Z04 


127182 


AA248620 


Hs.166011 


catenin (cadherln-assoclated protein); d 


2.04 


116870 


H67146 


Hs.38564 


ESTs 


2.04 


115448 


AA284845 


Hs.165051 ESTs 


Z04 


127231 


AA434584 




zw52c03.r1 Soares.totaLfeius_Nb2HF8.9w Z04 


103457 


X99728 




H^apiens NDUFV3 gene, exon 3 


2.04 


134737 


U00802 


Hs.89434 


drebrin 1 


2.04 



117046 H89505 yu81«5l Soares fetal fiver spleen 1NFL 

to contains Alu repetitive element;, mR 2.04 

124579 N68345 Hs.127179 ESTs; Weakly slmBar to TERATOCARCINOMA- 

112132 R45970 Hs.236349 EST 2.04 

132281 AA133300 Hs.43803 leukocyte-associated Ig-ifte receptor 2 2.03 

103668 Z83741 Hs.248174 H2A histone family; member M 2.03 

113501 T89107 Hs.13262 ESTs 2.03 

125021 T70060 Hs.163918 ESTs 2.03 

115754 AA420998 Hs.178095 ESTs 2.03 

123405 AA521370 Hs.191708 ESTs Z03 

102054 U07695 Hs.155227 EphB4 Z03 

115627 AA401910 Hs.119175 ESTs; WeaWy similar to ZINC FINGER PROT ZG3 

129252 AA234663 Hs.109773 ESTs 2.03 

103417 X96849 H^piens 5 mRNA of PECAM-1 molecule 2.03 

133721 U11863 Hs.75741 amitoride binding protein 1 (amine codda Z03 

114176 Z39059 H$37267 ESTs; Weakly similar to tetraspan TM4SF 2.03 

123S66 C14068 Hs.21806 ESTs; Moderately sintiar to similar to N Z03 

134236 D45371 Hs.80485 adipose most abundant gene transcript 1 Z03 

116381 AA598614 Hs.65394 ESTs Z03 

103711 AA046737 Hs.102792 ESTs Z03 

109316 AA206914 Hs.86322 EST 2.03 

123793 AA620343 Hs.112858 ESTs Z03 

128462 M69238 Hs.166172 ajylhydrocajbonrecep^rnudeartrard Z03 

117690 N40467 Hs.93834 ESTs 2.03 

113301 T67452 Hs.13104 EST 2.03 

134563 AA173430 Hs.85335 Homo sapiens mRNA; cDNA DKFZp564D1462 (f 

108316 AA070160 zm69f4.s1 Stratagene neuroepimeOum (#9 Z03 

135239 AA454599 Hs. 19399 Homo sapiens chromosoma 19; fosmld 395542.03 

120342 AA207105 Hs.45068 Homo sapiens mRNA; cDNA DKFZp434l143 (fr 

103493 Y08976 Hs.234759 H^apiens mRNA for FEV protein 2.02 

114204 Z39259 Hs.26096 ESTs 2.02 

125425 H62307 Hs. 18575 ESTs; WeaJdy similar to K1AAQ246 \Hsap\ 2.02 

133027 AA402624 Hs.63236 synucteln; gamma (breast cancer^pedfic Z02 

131323 H54036 Hs.25619 death-associated protein kinase 3 Z02 

121515 AA412133 Hs.104696 ESTs Z02 

129780 AA291526 Hs. 124699 ESTs ZQ2 

131292 AF005039 Hs.200600 secretory earner membrane protein 3 2.02 

132973 AA035446 Hs.214361 ESTs 2.02 

103727 AA059415 Hs.6289 growth factor receptor-bound protein 2 Z02 

113174 T54659 Hs.9779 ESTs 2.02 

120964 AA398085 Hs.142390 ESTs 2.02 

134303 AA457242 Hs.8141 etoposide-induced mRNA Z02 

128118 T81623 HsJ21765 hypothetical protein of unknown funcfio 2.02 

121087 AA398751 Hs.97304 ESTs 2.02 

102806 U90306 Human iroquois-class homeodomain protein Z02 

103195 X70940 Hs2642 euteryotctranslafionelrjngat^ Z02 

126767 C17148 C17148 Clontech human aorta polyA+mRNA 2.02 

105179 AA189083 Hs^1974 ESTs; Moderately similar to mBOCT [NLmus Z02 

116797 H40486 yn87aQ8.s1 Soares adult brain N2b5HB55Y 

3' similar to contains Alu repefitlvee 2.02 

133268 AAD99404 Hs.69307 ESTs 2.02 

123951 AA621721 Hs.231130 EST 2.02 
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115463 AA286819 Hs.69485 ESTs; Weakly similar to simBar to other 2.02 

110603 H65776 Hs.222403 ESTs 2.02 

101234 L29277 Hs.142258 signal transducer and activator of trans 2.02 

121208 AA400470 Hs.97805 ESTs 202 

122598 AA453465 Hs.99329 ESTs 2.02 

110868 H84882 Hs.33791 ESTs; Weakly similar to K:CI cotransport 2.02 

117137 H96670 Hs.42221 ESTs 2.02 

119389 T88826 H&90973 ESTs 2.01 

102940 X13956 Hs.24998 Human 12S RNA induced by poJy(ri); poiyt 201 

100748 HG3517-HT3711 Antitrypsin, 5' End 2.01 

103012 X52638 Hs.739 6^hKphofrycto-2-kinase/fructose^^i 201 

132755 AA609201 Hs.182635 ESTs 201 

130842 H39589 Hs.20159 ESTs; Highly similar to CGI-92 protein [ 2.01 

133599 M64788 Hs.75151 RAP1; GTPase activating protein 1 2.01 

117250 N21081 Ks.15299 HMBA-inducfcle 2.01 

115124 AA256666 H&39156 ESTs 2.01 

128155 AA926843 Hs.143302 ESTs 201 

130574 AA379087 Hs.16178 apoptosis antagonizing transcription fac 201 

132601 R78838 Hs.54943 fracture callus 1 {raQ homotog 2.01 

117428 N27366 Hs.43933 EST 2.01 

1211Q8 AA399053 H&97529 EST 2.01 

130518 XB9550 Hs.159181 Rho GDP dissociation inhibitor (GDI) alp 201 

110606 H66049 Hs.19085 ESTs; Weakly simBar to putative p150(H 2.01 



120606 AA282956 Zt15h4.s1 NCICGAP.GCB1 Homo sapiens cON 

SW.CADRJdOUSE P3938 RET1NAL-CADHERIN PR 201 



130070 T47969 Hs.194660 cera!^onjsc^c^;rimironal3;juven 201 

130331 Z80783 Hs^39884 H2B histone ramify; member L 201 

109599 F02602 Hs.6749 ESTs 2.01 

131749 W78211 Hs31547 ESTs; Highly similar to NADH:ubiqulnone 201 

129463 AA376905 Hs.111742 ESTs; Weakly similar to 111! ALU SUBFAMI 201 

114680 AA235698 Hs.65862 ESTs 2.01 

114745 AA135523 Hs.139064 EST 2.01 

115637 M402727 Hs.76925 ESTs; Highly similar to R31167J2; parfia 201 

109043 AA159605 Hs.72580 ESTs 201 

128901 Z41411 Hs.107040 ESTs 201 

124427 N36812 Hs.178663 ESTs 2 

100673 HG3033-HT3194 Spfceosomal Protein Sap 62 2 

108436 AA078801 zm94a9.s1 Stratagene colon HT29 (#937221 2 

123764 AA610019 Hs.1 12654 ESTs 2 

129343 N70791 Hs.180060 ESTs 2 

122794 AA460254 Hs, 105043 EST 2 

128688 AA161469 Hs.1 03755 receptor-interacting serine-trirecnine Id 2 

115592 AA399543 Hs.48026 ESTs 2 

111693 R22007 Hs.23321 EST 2 

113353 T79186 Hs.14468 ESTs 2 
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Pkey: Unique Eos probeset identifier number 
ExAccru Exemplar Accession number, Genbank accession number 



UmgcticID* Umgcnc number 
Unigene Title: Unigene gene title 

Pkey ExAccn UnlGID Complete Title Ratio BS/Met 

106024 AA412059 Hs.111742 ESTs; Weakly similar to III! ALU SUBFAMI 0.17 

110930 N48603 Hs.14947 ESTs 0.18 

105772 AA347973 Hs.221132 ESTs 0.2 

133271 248633 Hs.6940 H^aptens mRNA tor retrotranspcscn 0.2 

107109 AA609943 Hs32793 ESTs 0.24 

109593 F02506 Hs.159591 thyroid hormone receptor Interact* 8 0.24 

123016 AA480103 Hs.111730 ESTs; Weakly similar to alternatively sp 0.25 

100739 HG3484-HT3678 Protein Kinase (Gb:M59287) 0.25 

130252 U92014 Hs.153527 Human done 121711 defective mariner tra 0.26 

105149 AA169253 Hs.8958 ESTs 0.26 

115412 AA283B04 Hs.193552 ESTs 0.27 

105952 AA405263 Hs.181400 ESTs 0.28 

10S596 AA456981 H&35349 ESTs 0.28 

120249 AA167567 Hs.133325 ESTs 0.28 

111676 R19414 Hs.166459 ESTs 0.29 

111161 N66767 Hs.124145 ESTs 059 

109364 AA215379 Hs.50418 ESTs 059 

132316 1)28831 Human protein irrimuno-reactive with anfi- 0.3 

104030 AA363131 Hs.222992 ESTs; Weakly similar to TRANSFORMATIONS 0.3 

109825 F13663 Hs.16798 ESTs 03 

111110 N63165 Hs.23618 ESTs 031 

135315 W90583 Hs.9853 ESTs 032 

104792 AA029288 Hs.29147 ESTs; H^hly similar to ZINC FINGER PROT 033 

123562 AA608893 Hs.190065 ESTs 033 

116079 AA455286 Hs£4982 ESTs; WeaktysimBarto III! ALU SUBFAMI 033 

110671 H67770 Hs.153800 ESTs 033 

108819 AA130986 Hs, 193253 ESTs 0.34 

115558 AA393806 Hs.1010 regulator of mitotic spindle assembly 1 034 

104781 AAQ26617 Hs.21610 ESTs; Highly similar to BAJ1 -associated 034 

111236 N69324 Hs. 12526 Homo sapiens clone 23903 mRNA sequence 0.34 

113341 T77866 Hs.189703 ESTs 035 

125371 AI084676 Hs 133266 ESTs; Moderately similar to Sqv-7-iike p 035 

115890 AA435853 Hs.44114 ESTs; Weakly similar to CGI-73 protein [ 035 

113571 T91116 Hs.15713 ESTs 0.35 

121683 AA417911 Hs.175663 ESTs 035 

105489 AA256157 Hs.24115 ESTs 035 

11632) AA490866 Hs.39429 ESTs 0.3S 

111917 R39882 Hs.21397 ESTs 036 

127568 T53722 ya91c05.r3 Stratagene placenta (#937225) 036 

123541 AA608794 Hs.112592 ESTs 0.36 

123131 AA487207 Hs.193272 ESTs 036 
125069 T86914 Hs.194485 ESTs 036 
114757 AA136725 Hs.161990 ESTs 037 
132778 AA446695 Hs3671 Homo sapiens clone 23926 mRNA sequence 037 

123132 AA487233 Hs.1 06711 eukaryotic translation in Wen factor 037 
134029 AA378597 Hs.143601 ESTs; Moderately similar to 67A9.b [D.me 037 
126956 AI434405 Hs.171957 triple functional domain (PTPRF interact 0.38 
108869 AA487563 Hs, 1888 13 ESTs 0.38 
107818 AA020957 Hs.167948 ESTs 0.38 
129974 K00829 Hs.1 99300 Human kpni repeat mma (cdna done pctWt 0.38 
129477 D49728 Hs.1 119 nuclear receptor subfamily 4; group A; m 0.36 
119369 T79020 Hs.245915 ESTs; Weakly similar to kinase-related p 0.39 
114021 W91995 Hs.16145 ESTs 0.39 
122024 AA431296 Hs.139433 EST 0.39 
130014 N50959 Hs.143102 am!neoxiaase;a)ppercotitaimng2(retj 0.39 
110163 H19326 Hs.22073 ESTs; Highly similar to J KAPPA-RECOMBIN 0.39 
104641 AA004652 Hs.18564 ESTs 0.39 
124777 R41933 Hs.140237 ESTs 0.39 
125382 AA713494 Hs.194660 cerold^iponjscinosis; neuronal 3; juven 0.4 
120406 AA234999 Hs.111279 ESTs; Weakly similar to unnamed protein 0.4 
132734 R23653 Hs.164250 ESTs 0.4 
117001 H84719 Hs.40721 EST 0.4 



120905 AA371602 Hs.182930 ESTs; Highly similar to PHOSPHAT1DYUNOS0.4 
(25488 AA355158 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 0.4 
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121989 AA430044 Ha.193784 Homo sapiens mRNA; cDNA DKFZp586K1922 (f 0.4 

127821 AA806616 H&209523 ESTs 0.4 

119830 W74700 Hs.53478 ESTs 0.41 

106292 AA435571 Hs.148560 ESTs 0.41 

102762 U 82303 Hs.123080 Homo sapiens unknown protein mRNA; parti 0.41 

113518 T89731 ye11f06^1 Statagene lung (#937210) H s 

to contains Alu repetitive element;cont 0.41 

100635 HG2724-HT2820 Oncogene Tls/Chop, Fusion Activated 0.41 

113319 T70356 Hs.193141 ESTs; Weakly similar to coding sequence 0.41 

121319 AA402935 Hs.194242 ESTs; Weakly similar to 1!!I ALU CLASS B 0.42 

111818 R34382 Hjl24779 ESTs 0.42 

104883 AA052959 Ks.177409 ESTs; Highly similar to dJ1 1 19D9.2 pisa 0.42 

129258 W95592 Hs.251946 ESTs; Moderately similar to POLYADENYLAT0.42 

130576 T86475 Hs.16193 Homo sapiens mRNA; cDNADKFZp586B211 (fr 0.43 

108354 AA443271 Hs.26764 K1AA0546 protein 0.43 

108841 AA132524 Hs.70614 ESTs 0.43 

113922 W80741 Hs.37890 ESTs 0.43 

120997 AA3982B5 Hs.97598 EST 0.43 

108158 AA054597 Hs.221935 ESTs 0.43 

124516 N58185 Hs.131830 ESTs 0.43 
114477 AA032013 Hs.144260 EST 0.43 

104290 C16652 Hs.107205 Homo sapiens mRNA; cDNA DKFZp434L2221 (f 0.43 

126700 AI318412 Hs.108258 actin binding protein; macrophln (microf 0.44 

110887 N38770 H&42B3 ESTs 0.44 

116141 AA460420 Hs.44949 ESTs 0.44 

110689 H93046 Hs.15571 ESTs 0.44 

115314 AA280583 Hs.256501 ESTs 0.44 

110904 N39453 Hs.27371 Homo sapiens mRNA; cDNA DKFZp566J123 (fr 0.44 

I09462 AA233375 Hs.78085 ESTs 0.44 

102284 U31449 Hs.11881 transmembrane 4 superfemDy member 4 0.44 

118654 N70582 Hs.49892 ESTs 0.44 

1 15334 AA281244 Hs.65300 ESTs 0.44 

113149 T51588 ESTs; Moderately similar to II!! ALU SUB 0.44 

113721 T97931 Hs.18190 EST 0.44 

111299 N73808 H&24936 ESTs 0.44 

I03778 AA094107 Hs.7187 ESTs; Weakly similar to similar to gtyco 0.44 

113204 T57865 Hs.10310 EST 0.44 

100315 D50857 Hs.82295 dedicator of cytokinesis 1 0.44 

115254 AA279024 Hs. 194437 ESTs 0.44 

125500 H46104 Hs.244624 ESTs 0.44 

117387 N260/I1 Hs.53810 ESTs 0.45 

135113 W42450 Hs^06833 ESTs 0.45 

124517 N58204 Hs.199945 ESTs 0.45 
1 20379 AA227849 Hs.238380 Human endogenous retroviral protease mRN 0.45 
119205 R91954 Hs.153699 ESTs 0/45 
I28266 T70341 Hs.131897 ESTs 0.45 
104106 AA422123 Hs.42457 ESTs 0.45 
115864 AA432080 Hs.81200 ESTs 0.45 
113771 W02695 Hs.18714 ESTs 0.45 

126515 AI124649 Hs.252708 Homo sapiens mRNA; cDNA DKFZp586O031 (fr 0.45 

I27823 AA524806 Hs.78869 transcription elongation factor A (Sll); 0.45 

116665 RM405 Hs.223654 EST 0.45 

I06355 AA443272 Hs.27836 ESTs 0.45 

I32693 AA621429 Hs£5075 KIAA0410 gene product 0.45 

I07388 W01587 Hs.173319 ESTs 0.45 

110688 H93021 Hs.162937 peptidylprolyi isomerase A (cyctophilin 0.46 

116893 H69569 Hs.191316 EST 0.46 

I05375 AA236542 Hs.9512 ESTs; Moderately similar to HQ ALU SUB 0.46 

115601 AA400277 Hs.48849 ESTs 0.46 

106896 AA489707 Hs^9896 ESTs; Weakly similar to profine-rich pro 0.46 

111770 R27975 Hs.187469 ESTs 0.46 

115663 AA405838 Hs.40507 ESTs 0.46 

131404 AA504744 Hs.26461 ESTs; Weakly similar to gc-rich sequence 0.46 

I08622 AA101828 Hs.189956 ESTs 0.46 

I28286 AI025771 Hs.144090 ESTs 0.46 

1 05760 AA338960 Hs.28170 ESTs 0.46 

I00020 AFFX control: BtoB-3 0.46 

105209 AA205072 Hs.227743 KIAA0980 protein 0.47 

111975 R41724 Hs.149566 ESTs 0.47 

114688 AA121403 Hs.144331 ESTs 0.47 

116994 H83918 Hs.40528 ESTs 0.47 

H8401 N64762 Hs.49053 EST 0.47 

110997 N52540 Hs.74316 desmoplaWn (DPI; DPII) 0.47 

123791 AA620331 Hs.245351 EST 0.47 

I09858 H02266 Hs.167451 ESTs 0.47 

M5470 AA287122 Hs.48391 ESTs 0.47 
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130606 AA402109 Hs.16593 ESTs 0.47 

116067 AA454827 Hs.124623 ESTs 0.47 

125881 AA775807 Hs.150741 2';3 , -cycEc nucleotide 3* phosphotfestB 0.47 

124028 F04112 Hs.177178 ESTs 0.47 

108395 AA155574 Hs.172702 ESTs 0.47 

125102 T95105 Hs.173772 ESTs 0.47 

110421 H48462 Hs.36093 ESTs; Weakfy similar to reverse transcri 0.47 

105658 AA282914 Hs.10176 ESTs 0.47 

129046 .AA195678 Hs.108258 actin binding protein; macrophm (microf 0.47 

113639 T95128 Hs.17529 ESTs 0.48 

132575 AA045365 Hs.5188 ESTs; Weakly similar to 60S RIBOSOMAL PR0.48 

132592 AA129390 Hs.5285 ESTs 0.48 

107619 AA004955 Hs.60015 ESTs 0.48 

118664 N70907 Hs.230619 EST 0.48 

127612 AA917801 Hs.116076 ESTs 0.48 

112319 R55S15 Hs.26432 ESTs; Weakly stmflar to finger protein H 0.48 

113635 T95087 Hs.15543 ESTs 0.48 

119344 T62969 Hs.193348 ESTs 0.48 

121080 AA398720 Hs.177953 ESTs 0.48 

133686 X83378 Hs.211614 chloride channel 6 0.48 

130395 R54534 Hs.87889 helcase-moi 0.49 

127530 AA563806 Hs, 145728 ESTs 0.49 

132971 AA033951 Hs.61700 ESTs 0.49 

127132 AA721156 Hs.190440 ESTs 0.49 

129980 T72661 Hs.13969 ESTs 0.49 

105323 AA234112 HsJ29075 ESTs 0.49 

114439 AA018937 Hs.128629 ESTs 0.49 

107632 AA007242 Hs#>179 EST 0.49 

130952 AB002296 Hs.21560 Human mRNA for KIAA0298 gene; complete c0.49 

127595 AA927308 Hs.130464 ESTs 0.49 

124276 H83465 Hs.221934 ESTs 0.49 

125935 H30721 H&30172 ESTs 0.49 

131275 U45974 Hs.25156 Human phosphaWytaftol (4;5) bisphos 0.49 

131186 C20833 Hs^4129 ESTs 0.49 

125505 AI127843 Hs.155071 ESTs 0.5 

113327 T71776 Hs.12097 ESTs 0.5 

104709 AA017146 Hs.34579 ESTs; Moderately similar to HI) ALU SUB 0.5 

115772 AA423972 Hs.8154 ESTs 0.5 

118296 N63150 Hs.48723 ESTs 0.5 

131453 C2Q596 Hs.26985 K1AA0457 protein 0.5 

104734 AA019528 Hs.32677 ESTs 0.5 

119358 T70550 Hs.193651 ESTs; Weakly similar to alternatively sp 0.5 
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Table 19: B survivor vs Mets - Up in B survivor 



Pkey: Unique Eos probeset identifier number 

ExAccm Exemplar Accession number, Genbank accession number 

Uw^cucID* Unt^cnc number 

Untgeoe Tide: Uoigcuc gene title 



Pkey ExAccn UniGJD Complete Trtle Ratio BS/Met 

333601 CH2LFGENESi13.4 5.5 

325300 CH.11_hsgP66908 4.67 

333642 CH22_FGENES.23U 4.64 

333591 CHZLFGENES-208 4 4.46 

332859 CH22J=GENES.27J 4.39 

304013 AW518573 Hs.156110 Immunoglobulin kappa variable 1M 4.23 

333791 CH22_FGENES.274J0 4.18 

327641 CH.04_hsgl)5867890 4.03 

321172 H49160 Hs.133472 ESTs 3.9 

334125 CH22_FGENES.334_4 3.88 

333646 CH2LFGENES.234J 3.88 
326554 CH.19Jisgi|5867308 3.84 
333650 CH2*_FGENES338_3 3.82 

333647 CH2£/GENES.235_2 3.79 
333626 CH22_FGENES.224_2 3.68 
314671 AW236550 Hs.131914 ESTs 3.68 
310847 AJ420523 Hs.161282 ESTs 3.67 
333657 CH22JGENES.241_J2 3.65 
338522 CH2ZJAftAC005500.GENSCAN.395V36 3.64 
329464 CH.YJisgiI6456788 3.6 
328868 CH.07„hsgl|6381930 3.6 
333637 CH22_FGENES.229J 359 
329737 CH.14_p2gP65779 ZJS 
317828 AI791749 Hs.128896 ESTs 3.44 
330520 M96995 Hs.6289 growth factor receptor-bound protein 2 3.44 
339271 CH22JA354I12.GENSCAN.11-2 3.44 
314927 AI735482 Hs.159580 ESTs 3.42 
334782 CH22LFGENES.432.7 3.42 
313138 AW138842 Hs.196669 ESTs 3.4 
332650 H51596 H&5541 ATPase; (^transporting; uUqutous 3.38 
338648 CH22LEM^C005500.GENSCAN.46O6 3.38 
325677 CH.14_hsgI|5867017 3.34 
312639 H50648 Hs.213221 ESTs; Weakly similar to till ALU SUBFAM1 3.33 
326545 CH.19Jisgi}5867307 3.32 
318364 R44616 Hs.138280 ESTs; Moderately similar to (HI ALU SUB 3.3 
308385 AJ625428 EST singleton (not in UniGene) with exon 3.26 
328569 CH.07.hsgl|6004480 3.26 
328582 CH.07Jisgi|6006033 3.24 
310975 AW92857 Hs.170940 ESTs 3.24 
336883 CH2Z/GENES.322-2 3.21 
324425 AW236939 Hs.172154 ESTs 3.2 
337870 CH22JWtAC005500.GENSCAN.48^ 3.19 
306624 AI001043 EST singleton (not in UniGene) with exon 3.17 
319091 Z45264 EST cluster (not In UniGene) 3.16 
335247 CH22JGENES.516J 3.12 
324945 AA088768 EST duster (not in UniGene) 3.1 
319468 R06504 EST cluster (not in UnKBene) 3.09 
301635 A1590720 Hs.192662 ESTs; WeaMy similar to ZINC FINGER PROT 3.08 
321215 AW378128 Hs.120243 ESTs; Weakly slmfiar to CGW6 protein [ 3.04 
328507 CH.07Jisgl|5868473 3.03 
330266 CK05j>2giI6671885 3.02 
326249 CH.17Jisgi|5867263 3.01 
325649 CH.14Jisgi|6588011 £99 
304575 AA496437 EST singleton (not in UniGene) with exon 2.98 
304559 AA468050 EST singleton (not In UniGene) with exon 2.97 
338412 CH22_EMAOT05500.GENSCAN.341-25 2.96 
308707 AI769997 EST singleton (not In UniGene) with exon 2.95 
313027 N34307 Hs.184003 ESTs; Weakly similar to lill ALU SUBFAMI 2.95 
306590 AI000246 EST singleton (not In UniGene) with exon Z95 
306183 AA922622 EST singleton (not tn UniGene) with exon Z94 
308611 AI735372 Hs.203820 EST; Moderately similar to TRANSLATIONAL 2M 
332454 T63265 Hs.11186 ESTs; Weakly similar to trarsforrnatorH 234 
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330081 




CH.17_p2gi|S72126l 


2.94 


317671 


AW138139 Hs.244598 ESTs 


193 


338705 




CH22JM^C005500.GENSCAN.480-4 


233 


333737 




CH22_FGENES.261J 


23 


337756 




CH2^ElWtAC000097.GENSCAN.109^ 


2,9 


333572 




CH22_FGENES.189J 


189 


335349 




CH2U=GENES539J 


189 


328835 




CH.07_hsgiI5868339 


2.89 


319886 


AA984628 


EST duster (not In UnlGene) 


2.88 


311247 


AI655313 


Hs.197692 ESTs 


187 


303887 


R72672 


Hs.193484 ESTs; Weakry simnar to Similarity wim 


2.86 


337564 




CH22_C65E1.GENSCAN.1-7 


2.85 


333225 




CH25LFGENES.107J 


2.84 


314938 


AA515635 


EST cluster (not in UniGene) 


183 


305803 


AA846052 


EST singleton (not in UniGene) with exon 


2.83 


305264 


AA679505 


EST singleton (not in UniGene) with exon 


2.83 


332646 


AA386264 H&5337 isodtrate dehydrogenase 2 (NADP+); mito 


2.81 


338508 




CH22_EfvLAC005500.GENSCAN^1-1 


2.81 


308097 


AI475411 


EST singleton (not En UniGene) with exon 


181 


301130 


AW194167 Hs.149418 ESTs; Weakly similar to saflvaiy proline 


2.8 


325571 




CH.1*_hsgI|6552439 


2.8 


307054 


AI148181 


Hs.176835 EST 


2.8 


337456 




CH22_FGENES.777-2 


2.79 


317870 


AI797066 


Hs.201995 ESTs 


2.79 


303171 


AA065003 Hs.64179 hypothetical protein 


2.78 


333717 




CH22_FGENES.253_3 


176 


303778 


AW505368 


EST cluster (not in UniGene) with exon h 


2.76 


304918 


AA602697 


EST singleton (not in UniGene) with exon 


2.76 


319373 


R00371 


EST cluster (not in UniGene) 


2.75 


336072 




CH22_FGENES.685_4 


2.74 


306023 


AA897764 


EST singleton (not in UniGene) with exon 


174 


336127 




CH22_FGENES.701.15 


174 


337355 




CH22_FGENES.728-1 


173 


337685 




CH22_EM^C005500.GENSCAN.54^ 


173 


308506 


AI686791 


Hs.1 19598 rtbosomal protein L3 


2.73 


300629 


AA152119 Hs.155101 ATP synthase; H+ transporting; mitochond 


2.73 


333043 




CH22_FGENES.70J 


2.72 


327736 




CH.05Jisgil5867940 


172 


333007 




CH22_FGENES.60_4 


2.72 


321966 


AL122111 


EST cluster (not in UniGene) 


2.72 


323179 


AW452576 Hs.156875 ESTs 


2.72 


332459 


AA609625 Hs.1 12933 Homo sapiens Tax Interaction protein 40 


2.71 


326224 




CH.17Jisgi|5867230 


171 


329114 




CHJO*gi[5868650 


2.7 


333577 




CH2U=GENES.198_2 


2.69 


300413 


AW090347 Hs»>43443 ESTs 


2.67 


304055 


R07994 


EST singleton (not in UniGene) with exon 


2.67 


301013 


AI935304 


Hs.125262 . DKFZP586G1624 protein 


2.67 


337848 




CH22_EMA0005500.GENSCAN.33-1 


2.66 


327946 




CH.06_hsgiJ5868206 


2.66 


306300 


AA937573 


EST singleton (not in UniGene) with exon 


166 


331071 


R01646 


H&200538 ESTs 


2.65 


304841 


AA587541 


EST singleton (not in UniGene) with exon 


2.65 


301321 


A1860987 


Hs.189097 ESTs 


2.65 


311280 


AI767957 


Hs.197737 ESTs; Weakly simSar to Y38A8.1 gene pro 


2.65 


338843 




CH22_DJ246D7.GENSCAN.8-1 


2.64 


335720 




CH22_FGENES599_23 


164 


333670 




CH22J r GENES.245_4 


2.64 


313588 


AI803591 


Hs.209667 ESTs 


2.64 


335750 




CH22_FGENES.602_4 


2.63 


333240 




CH22J=GENES.111_4 


2.63 


332721 


R70212 


Hs,79630 CD79A antigen {immunogtobuiin-assoctated 


2.62 


338747 




CH22_EMAC005500.GENSCAN.511-1 


2.62 


303582 


AA377444 


EST duster (not in UniGene) wr&iexonh 


162 


336898 




CH22.FGENES.330-1 


2.62 


325835 




CH.16Jsgi|6552452 


162 


301660 


F13112 


EST cluster (not in UniGene) with exon h 


161 


335968 




CH22_FGENES.652J 


161 


336705 




CH22_FGENES.63-2 


2.6 


309815 


AW292760 


EST singleton {not In UniGene) with exon 


2.6 


339220 




CH22_FF1 13D11.GENSCAN.6-15 


16 


308582 


A1709056 


EST singleton (not In UniGene) with exon 


16 


334260 




CH22_FGENES.367_8 


2.6 


309963 


AW449073 


EST singleton (not in UniGene) with exon 


16 


300178 
335690 


A1282665 


Hs.166969 ESTs 


159 




CH22LFGENES.596J 


159 
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308127 AM92187 EST singleton (not In UniGene) with axon 2.59 

337835 CH2^EIVtAC005500.GENSCAN.22-4 2.58 

333251 CH^GBIES.Iiej 2.58 

330319 CH.Q8j>2gl|5932415 2.58 

314490 AI758114 Hs.197032 EST* 2.57 

305934 AA878815 Hs.75442 albumin 2i7 

329665 CH.14j2gl|6272129 25t 

328558 CH.07_hsgi|5B68489 2.57 

336094 CH2Z.FGENES.691J 2.57 

307899 AB8G270 EST singleton (not In UniGene) with exon 257 

339312 CH22JA354I12.GENSCAN.22-10 2.57 

336442 CH22.FGENES.827J 2.57 

317894 R60848 EST duster (not in UniGene) 2.56 

330435 HG2689-HT2785 Mudn 5b , Tracheobronchia! (GteX74955) 2.56 

327304 CH.01Jisgi|5867494 2.56 

308859 AIS30787 EST singleton (not in UniGene) with exon 2.55 

302224 AI951549 Hs.161166 K1AA1094 protein Z55 

304324 AA137045 EST singleton (not h UniGene) with exon 2.54 

338090 CH22LEMAC005500.GENSCAN.17M 2.53 

334797 CH22.FGENES.434.5 Z52 

303535 ALB43430 EST duster (not in UniGene) with exon h 2,52 

339037 CH22J)A59H1B.GENSCAN.26* 2.52 

327B46 CH.05_hsgij6531962 2.52 

325271 CH.l1_hsgi|5866901 . 252 

312385 R42885 Hs.215555 ESTs 2.51 

302816 AJ733918 Hs.204112 ESTs; Weakly similar to alternatively sp 2.51 

316941 AW449871 Hs.124591 ESTs Z5 

300184 AI285912 Hs.254515 ESTs 2.5 

333762 CH22.FGENES270J 2.5 

317028 AA962623 Hs.189144 ESTs; Weakly similar to RENAL SQDIUM-DEP 2.5 

326266 CH.17_hsgil5867264 ' 2.49 

328005 CH.16_hsgQ5867112 2.49 

301971 AJ003125 Hs. 120330 a dislrrtegrfn-S<e and meblioprotease ( 2.48 

326539 CH.19_hsgp67307 2.48 

338896 CH22JU32I10.GENSCAN.94 2.48 • 

306773 AJ040750 EST singleton (not in UniGene) with exon 2.47 

336279 CH2LFGENES.763.3 2.47 

321017 AUJ50345 Hs.227637 hypothetical protein 2.47 

306090 AA908609 EST singleton (not in UniGene) with exon 2.47 

333216 CH22_FGEr£S.104J 2.46 

338^93 CH22_EM:AC005500.GENSCAN.435-2 2.46 

333587 CH22JGENES505J 2.46 

300395 AW295466 Hs232051 ESTs 2.45 

304693 AA554263 EST singleton (not in UniGene) with exon 2.45 

338934 CH22_DJ32l10.GENSCAN.18-2 2.45 

325751 CH.14_hsgij6682474 2.45 

334137 CH22J=GENES337_1 2.45 

333581 CH22J=GENES.2O0 1 2.45 

302083 AI422807 Hs.134012 Ciliated factor " 2.44 

307318 AI208577 EST singleton (not in UniGene) with exon 2.44 

302181 AW374284 Hs.157732 Homo sapiens chromosome 19; cosmid R2689 Z44 

337425 CH2Z.FGENES.761-1 2.44 

336227 CH22J : GENES.730_2 2.44 

314657 AI015953 Hs.125265 ESTs 2.44 

338529 CH22_EMAC005500.GENSCAN^98-10 2.44 

333680 CH22_FGBJES.247_7 2.43 

324834 AJ003258 Hs.250891 ESTs 2.43 

305093 AA642917 EST singleton (not in UniGene) with exon 2.43 

335787 CH22.FGENES.611J 2.43 

311704 AI655206 Hs.121512 ESTs; Moderately similar to kmestn like 2.43 

329382 CHXJis g!J5868868 2.42 

334785 CH22_FGENES.432_10 2.42 

330130 CU21jj2gt|6002196 2.42 

327206 CH.01Jjsgi|5867447 2.41 

319235 F11330 Hs.177633 ESTs Z41 

334691 CH2Z.FGEhES.420 4 2.4 

327610 CH.04_hsgi|5867868 2.4 

327646 CH.04_hsgi|5867894 2.4 

337093 CH2Z.FGENES.465-18 2.4 

335081 CH22.FGENES.488.4 2.4 

333576 CH22.FGENES.193J 2.4 

337604 CH2Z_C2QH12.GENSCAN.16-5 2.4 

329879 CH.15_p2gi]6466518 2.4 

328444 CH.07Jisgij586B420 2.39 

335700 CH2Z.FGENES.598J Z39 

331255 Z41009 Hs.21446 ESTs; Weakly similar to HYPOTHETICAL PRO 2.39 
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327927 




CH.06_hsgi|5868173 


410 


334354 




CH22.FGENES377J 


41Q 


308517 AI689279 




EST singleton {not In UnlGene) with exon 


9 1Q 


303669 AW499648 Hs.233750 coplneV 


4 10 


333648 




CH2LFGBJES237J 


O 1ft 


318318 AI653893 


Hs.174463 ESTs; Weakly similar to alpha3b sub unit 


4 1ft 


338336 




CH2LEMAC005500.6ENSCANJKW 


2.00 


304125 H40976 




EST singleton (not in UnlGene) with exon 


4 1ft 

Z.Jo 


304983 AA617786 




EST singleton (not In UnlGene) with exon 


4 4Q 

2.00 


334935 




CH22LFGENES.464J 


4 4ft 


314326 AW170057 Hs.133179 ESTs 


4 1ft 


330406 D49490 


Hs.76901 


for protein disulfide Isomerase-related 


4 4Q 
-.00 


307646 A1302236 




EST singleton (not in UnlGene) with exon 


4 1ft 
Z.00 


338911 




CH22_DJ32I10.GENSCAN.114 


4 1Q 
_.O0 


319952 T79532 


Hs225725 ESTs; Moderately similar to C6M01 prat 


4 17 

23/ 


336878 




CH22_FGENES.318^ 


0 47 

2.37 


338140 




CH22_EM:AC005500.GENSCANm6 


1 17 


300564 A1383878 


Hs.225588 ESTs 


n mem 

2.37 


304635 AA523976 




EST sfogteton (not In UnlGene) with exon 


4 17 


334091 




CH22J=GENES327_47 


2.37 


336328 




CH2a.FGENES.812„7 


4 47 

2.37 


325310 




CH.11Jisgq5866864 


0 47 

2JSI 


338043 




CH2^EMtAC005500.GENSCAN.153>2 


4 47 

Z37 


307090 AI161024 




EST singleton (not in UnlGene) with exon 


4 47 

237 


335768 




CH22J=GENES.607_2 


2,37 


334969 




CH22_FGENES.466_2 


237 


333640 




CH2£FGENES.230_2 


0 Oft 

2.0b 


330002 




CH.16_p2gi|6623963 


236 


338829 




CH22JW246D7.GENSCAN.5-12 


2.36 


323808 AW250114 




EST duster (not in UnlGene) 


2-36 


327755 




CH.05.hsgi|5867955 


2.35 


306426 AA975039 




EST singleton (not in UnlGene) with exon 


2.35 


336481 




CH2a.FGENES.830J 


235 


335163 




CH22_FGENES.502_7 


235 


322012 AL137357 




EST duster (not in UniGene) 


235 


337345 




CH22_FGENES.723-1 


235 


334625 




CH22JK3ENES.414J 


4 4C 


320957 A1878933 




EST duster (not In UnlGene) 


235 


334915 




CH22_FGENES.457_4 


4 4C 

235 


336295 




CH22J r GENES.787_1 


4 QC 

2.35 


321556 N46402 


Hs.14570 


ESTs 


4 4K 


338491 




CH22_B^AC005500.GENSCAN.3S5-2 


4 4C 

235 


335517 




CH22J=GENES.57U4 


4 44 

234 


330639 X90872 


Hs.75854 


SULT1C sulfotransferase 


4 44 

234 


310383 AJ2G31Q2 


Hs.145596 ESTs 


4 IX 

234 


331526 N499S7 


Hs.46624 


ESTs 


4 44 

234 


334396 




CH22.FGENES38U 


4 44 

234 


332993 




CH22.FGENES.57J 


O 44 

234 


327487 




CH.GLhsglj5867785 


4 44 

234 


335920 




CH22JH5ENES.636J6 


233 


336463 




CH2^FGENES.829J2 


233 


319000 Z44318 




EST duster (not in UnlGene) 


233 


332992 




CH2LFGENES.57J 


233 


332920 




CH22.FGENES.37J 


233 


337590 , 




CH2i_C20H12.GENSCAN.6-5 


233 


327059 




Ca21_hsgiI5531965 


233 


334399 




CH2?J : GENES382J 


233 


300982 AA837754 




EST duster (not in UnlGene) wflh exon h 


232 


327430 




Ca0Lhsgi|5867754 


232 


326808 




CH.20_hsgP82504 


232 


309324 AW015373 




EST singleton (not In UniGene) with exon 


232 


329779 




CH.14_p2giJ6002090 


2.32 


330492 M25809 


Hs.64173 


ATPase; H+ transporting; lysosomal (vacu 


231 


330080 




CH.19_p2gpi5314 


2.31 


334342 




CH2? W FGENES.375_20 


231 


336306 




CH22 W FGENES.793_5 


231 


336400 




CH22.FGENES.823J5 


231 


323735 AA323714 




EST duster (not in UnlGene) 


231 


334496 




CH22.FGENES397J2 


231 


336075 




CH22.FGENES.687 1 


231 


335566 




CH2i.FGENES.580J 


231 


337657 




CH22 w EM:ACQ00097.GENSCAN32-9 


231 


327816 




CH.05Jisgl[5867968 


23 


308465 AI672480 




EST singleton (not in UniGene) with exon 


23 


330112 




CH.19_p2gl|6015238 


23 


304465 AA421948 




EST singleton (not In UniGene) with exon 


23 



172 



WO 02/068677 



308449 


AI660854 


EST singleton (not In UniGene) with exon 


2.3 


328171 




CH.06_hsgl|5868071 


Z3 


328271 




CH.06Lhs #552415 


2.3 


328803 




CH.07_hsgij6004475 


2.3 


330083 




CH.19_p2gij8165044 


229 


312281 


H11643 


EST cluster (not in UniGene) 


229 


328974 




CH.09 hsgi]5868520 


129 


333859 




CH22_"FGENES.2S0_18 


229 


326253 




CH.17_h3 #867263 


229 


325703 




CH.14Jis #867028 


Z29 


338925 




CH22_DJ32l10.GENSCAN.14-3 


2.29 


328552 




CH.07_hsgl|5868489 


£29 


337244 




CH22_FGENES.64fc8 


229 


314770 


AI732722 


Hs.187694 ESTs 


229 


324560 


AW502208 


EST duster (not In UniGene) 


229 


310803 


AW376860 Hs.156398 ESTs 


229 


337383 




CH22J r GENES.733-2 


229 


308015 


AI440174 


Hs228907 EST; WeakJy similar to GUANINE NUCLEOTID 


309208 


AI961962 


EST singleton (not in UniGene) with exon 


228 


337455 




CH22.FGENES.777-1 


228 


327605 




CH.03 hs #004463 


228 


301611 


W22172 


Hs.59038 ESTs 


2.28 


317222 


AI206964 


Hs.130051 ESTs 


2.28 


338278 




CH22 EMACQ05500.GENSCAN29W 


228 


337291 




CH2^FGENES.673.2 


227 


337913 




CH22.EMAC005500.GENSCAN.59-10 


227 


308406 


AA971973 


EST singleton (not in UniGene) with exon 


227 


332947 




CH22.FGENES.47J0 


227 


321763 


W01148 


EST duster (not in UniGene) 


227 


304424 


AA293494 


EST singleton (not in UniGene) with exon 


Z27 


303782 


T64737 


EST cluster (not in UniGene) with exon h 


227 


326943 




CH^Ujsgp04446 


227 


324977 


R14439 


Hs209194 ESTs 


227 


325480 




CH.12_hsgi|5865957 


227 


327743 




CH.05 hsgi|5867944 


227 


333221 




CH22.FGENES.105J 


226 


336498 




CH2^_FGENES.833J 


226 


321583 


H84421 


EST cluster (not In UniGene) 


Z26 


334191 




CH22_FGENES.352_6 


Z26 


327089 




CH21Jtsgl|6531965 


226 


310001 


F18939 


Hs.153827 ESTs 


226 


304056 


R0B577 


EST singleton (not in UniGene) with exon 


225 


324700 


AW504745 


Hs.103913 ESTs; Moderately similar to 111! ALU SUB 


225 


330637 


X86371 


Hs.95659 lethal giant larvae (Drosophifa) homolog 


225 


307642 


AI302103 


EST singleton (not in UniGene) with exon 


225 


336985 




CH22_FGENES.402-6 


225 


334425 




CH22_FGENES.384_13 


225 


321216 


AI078042 


Hs.126691 ESTs 


225 


315785 


AW205946 Hs.150319 ESTs 


225 


305809 


AA853998 Hs.124580 EST 


225 


331334 


AA284858 Hs.89134 ESTs 


225 


317131 


AI991125 


Hs.189109 ESTs 


225 


334216 




CH22J : GENES.358J 


224 


330330 




CR08j>2gil5670267 


224 


326923 




CH21Jisgi|6456782 


224 


333774 




CH22_FGENES272_5 


224 


324311 


AA443061 


Hs.202520 ESTs 


124 


338551 




CH2^BMAC005500.GENSCAN.413^ 


Z24 


306716 


AI024916 


Hs 251354 ESTs 


224 


337689 




CH22_EMACQ00097.GENSCAN.77-5 


224 


300079 


AI192520 


Hs.147178 EST 


223 


334617 




CH22_FGENES.411J6 


223 


336890 




CH22_FGENES.326-10 


223 


334495 




CH22.FGENES.397J0 


223 


327301 




CH.01JW #867493 


Z23 


337856 




CH22L.EMAC005500.GENSCAN.41-3 


Z23 


307072 


AI150424 


Hs.146817 EST 


223 


330515 


M85247 


Usapiens dopamine D1A receptor gene, co 


222 


325943 




CH.16JsgP67138 


222 


338947 




CH22JXJ32n0.GENSCAN21-4 


222 


317465 


AW197361 Hs.131360 ESTs 


222 


332458 


M33493 


Hs.1B4504 tryptase; alpha 


222 


333195 




CH22LFGENES.98J7 


222 


304837 


AA567139 


EST singleton (not in UniGene) with exon 


222 


307602 


A1288843 


Hs.231239 EST 


222 


337078 




CH22_FGENES.457-1 


Z22 



173 



WO 02/068677 



335862 






CH2ajGENES.629_7 


2.22 


301979 


L28168 


Hs.121495 


potassium voltage-gated channel; Isk-rel 


2.22 


335668 






CH22_FGENES.590J9 


2.22 


305068 


AA639616 




EST singleton (not in UniGene) with axon 


2.21 


329034 






CHX.hsgi|5868561 


2.21 


318403 


AI131241 


Hs.143234 


ESTs 


2.21 


328058 






CH.06Jisgi|5902482 


2.21 


335513 






CH22_FGENES.571_28 


2.21 


330803 


AA004699 Hs.150580 


putative translation initiation factor 


2.21 


331427 


H54764 


Hs.237339 


EST 


2.21 


338973 






CH22JW32I10.GENSCAN.27-6 


2.2 


336723 






CH22.FGENES.85-3 


2.2 


327290 






CH.01Jtsgt{5867483 


22 


337240 






CH22_FGENES.644-1 


22 


306201 


AA926818 




EST singleton (not in UniGene) with exon 


2.2 


303659 


AA868464 Hs.126263 


ESTs; Highly similar to FIBRHLARIN IrU 


2.2 


334517 






CH22_FGENES.399_7 


22 


334169 






CH2£JGENES.352_4 


22 


335199 






CH22JGENES.508J 


22 


333705 






CH22_FGENES.250J9 


22 


305794 


AA845324 




EST singleton (not tn UniGene) with exon 


22 


303273 


AA316069 




EST cluster (not in UniGene) with exon h 


22 


313384 


W85694 


Hs.1 18335 


ESTs 


22 


329158 






CHX_hsgi|5868687 


22 


337551 






CH22_FGENES,847-8 


22 


328792 






CH.07_hsgi|586B309 


22 


303737 


AW502711 




EST duster (not in UniGene) with exon h 


2.19 


324529 


AW502466 




EST cluster (not in UniGene) 


2.19 


323103 


Z45529 


Hs.92030 


ESTs 


2.19 


333773 






CH22_FGENESi72.4 


2.19 


337905 






CH22_EMAC005500.GENSCAN56-19 


2.19 


327129 






CH.21Jisgil6531976 


2.19 


305710 


AA826544 




EST singleton (not in UniGene) with exon 


2.19 


335595 






CH2^FGENES.581_34 


2.19 


323646 


AA310926 Hs.154412 


ESTs 


2.19 


328368 






CH.07_hsgl|5868388 


119 


325802 






CH.14.hsgiI6552451 


2,19 


337167 






CH22J : GENES.562-27 


2.19 


305059 


AA635756 




EST singleton (not in UniGene) with exon 


2.18 


321445 


AW245524 Hs.12159D 


ESTs; Weakly similar to ZINC FINGER PROT 118 


332790 






CH22 FGENES.2.4 


2.18 


336750 






CH2a.FGENES.1284 


2.18 


310999 


A1520706 


Hs.171012 


ESTs 


2.18 


329798 






CR14_p2gH6523160 


2.18 


327012 






CH2Lhsgi{5867664 


2.18 


304599 


AA506638 




EST singleton (not In UniGene) with exon 


2.18 


335351 






CH22J : GENES.539_4 


118 


310661 


AI354717 


Hs.223908 


ESTs 


118 


332791 






CH2a.FGENES.3J 


2.17 


333022 






CH22.FGENES.65J 


2.17 


310502 


A1458973 


Hs.170422 


ESTs 


2.17 


324963 


AA853440 




EST cluster (not in UniGene) 


2.17 


325275 






CH.11Jisgi|5866902 


2.17 


328338 






CH.07Jisgi[5868377 


2.17 


333063 






CH2a.FGENES.75J 


2.17 


308895 


AIB58423 




EST singleton (not in UniGene) with exon 


2.17 


338685 






CH2a.EMAC005500.GENSCAN.4724 


2.16 


325655 






CH.14_hsgl|5867007 


116 


33242) 


H49570 


Hs.108074 


ESTs; Weakly simdarto CEREBELUN 1 PRE 


■2.16 


337216 






CH22_FGENES.613-10 


116 


335660 






CH2a_FGENES.590J1 


2.16 


337145 






CH2^FGENES542-2 


2.16 


335753 






CH2LFGENES.6Q4J 


2.16 


301766 


R02224 




EST cluster (not in UniGene) with exon h 


2.16 


303442 


AI953998 


H9.152510 


ESTs; Weakly similar to L-SERiNE DEHYDRA 


311009 


AI949701 


Hs.210589 


ESTs 


2.16 


307093 


AI167606 




EST singleton (not in UniGene) with exon 


2.16 


300262 


AI874402 


Hs.170810 


ESTs 


2.16 


337989 






CH22_EMAC005500.GENSCAN.H2-7 


2.16 


326263 






CH.17_hsgl|5867264 


2.16 


319402 


W21298 




EST cluster (not in UniGene) 


2.16 


321010 


Y17456 


Hs.227150 


Homo sapiens LSFR2 gene; last exon 


116 


301706 


AI929150 


Hs.241496 


ESTs 


116 


307412 


AI241753 


Hs.241507 


ribosomal protein S6 


116 


335662 






CH2?_FGENES^90J3 


2.15 


332480 


AA092932 Hs.12570 


tubufln-specific chaperoned 


115 



174 
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PCT/US02/06001 



329273 CHXJvs gi|5858762 Z15 

339383 CH22_BA232E17.GENSCAN.3-20 2.15 

332795 CH22J=GENES5J 2.15 

335227 CH2£/GENES.513J3 2.15 

326925 CH.21Jisgi|6456782 2.15 

332403 AA424199 Hs.105529 ESTs; Highly similar to CGW5 protein [ 2.15 

317786 AI859605 Hs.155686 ESTs 215 

326582 CH.19_hsgIl5867318 2.15 

336494 CH^GENES^H 2.15 

329656 CH.14j)2gl|6448516 2.15 

307581 AI284415 EST singleton (not in UntGene) with exon 2.15 

335670 CH2£.FGENES.591_2 2.14 

332452 AA040369 Hs.11170 SYT interacting protein 214 

309387 AW079943 Hs.156110 Immunoglobulin kappa variable 1D-8 2.14 

308427 AI652677 Hs.195055 EST 2.14 

322027 NM.004551 EST duster {not in UniGene) 2.14 

301693 Z45023 EST duster (not in UniGene) with exon h 2.14 

334308 CH22_FGENES.373_11 2.14 

301131 AW134518 Hs.131807 ESTs 213 

338495 CH22_EfvtACC05500.GENSCAN.387-1 Z13 

329600 CH.10_p2gi|3962481 2.13 

307980 AI431696 EST singleton (not in UniGene) with exon 2.13 

337260 CH22.FGENES.652-15 213 

304655 AA527887 EST singleton (not in UniGene) with exon 2.13 

303141 AF195951 EST duster (not in UntGene) with exon h 2.13 

327957 CH.06Jsgi|5868210 2.13 

334317 CH23JFGENES374J 2.13 

302870 AF011407 EST duster (not in UniGene) with exon h 2.13 

333808 CH22_FGENES57BJ 2.13 

329947 CH.16_p2gJ554O101 213 

309602 AW182523 EST singleton (not In UniGene) with exon 2.13 

322790 AI700273 Hs.122162 ESTs; Weakly similar to WAA0557 protein 2.13 

337706 CH22_EWAC000097.GENSCAN.87-H 2.13 

306894 AI092731 EST singleton (not in UniGene) with exon 213 

325530 CH.12_hsgij6525289 212 

321087 AL110227 Hs_>41533 Homo sapiens mRNA; cDMA DKFZp434J194 (fr 2.12 

309853 AW298169 Hs.57553 tousied-Oke kinase 2 212 

326822 CR20_hs #117831 212 

328776 CH.07_hsgiI5868309 212 

335112 CH22.FGENES.494J0 212 

334564 CH22_FGENES.405_4 2.12 

333455 CH22.FGENES.157J 212 

317395 R55044 Hs.124130 ESTs 212 

334221 CH22_FGENES.360_1 212 

331374 AA442134 Hs.70573 ESTs; WeaMysimBarto HINT PROTEIN [H. 212 

304473 AA428343 Hs.140 immunogtobufin gamma 3 (Gm marker) 212 

328907 CH.08Jsgi|5868493 212 

319448 R05539 Hs.108738 ESTs 212 

333676 CH2*_FGENES.247_3 212 

324767 AA630931 Hs^4348 Homo sapiens mRNA; cDNA DKFZp434P0235 (f 212 

318585 Z43405 EST duster (not in UniGene) 212 

331732 AA251192 Hs.177708 ESTs 212 

329553 CH.10js2gi]3962492 2.12 

336910 CH22_FGENES.343^ 212 

326959 CR2Lhsgi|6469836 212 

305417 AA725228 EST singleton (not in UniGene) with exon 211 

301573 AI150328 Hs.226402 ESTs; Weaxty similar to mitochondrial ci 2.11 

326935 CR21_hsgP04446 211 

335176 CH22.FGENES.504J 211 

337210 CH22_FGENES.603-5 211 

311284 AW027025 Hs.239262 ESTs 211 

330240 CH.05_p2g1|6671858 211 

327463 CH.02_hsglI60D4455 211 

332938 CH22.FGENES.41J 2.11 

332785 CH22.FGBIES.1J 2.11 

301035 AI358105 Hs.123164 ESTs 21 

305712 AA826701 EST singteton (not in UniGene) with exon 21 

318651 AW003150 Hs.240165 ESTs 21 

302753 M74299 EST cluster (not in UniGene) wifii exon h 21 

334635 CH22LFGENES.417-2 2.1 

319447 AA456745 EST duster (not in UniGene) 21 

301204 AW008544 Hs.239994 ESTs 21 

333950 CH22 F6ENES.303_6 21 

325947 CH.16_hsgi|5867138 2.1 

337683 CH22.EWfcAC000097.GENSCAN.76-1 21 

328962 CH.08_hsgi|6456775 2.1 
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336655 CH2Z.FGENES.34-3 2.1 

336596 CH22_FGENES.163J Z1 

330486 M13755 Hs.833 Inteiferon-stinuilated protein; 15 kDa 2.1 

314356 AA531607 Hs.125143 ESTs 2.09 

314976 AA524725 H3.162108 ESTs 2.09 

336650 CH22J=GENES.29-£ Z09 

339026 CH2Z_DA59H1B.GENSCAN.22-6 2.09 

302395 AW297357 Hs.114606 ESTs 2.09 

323280 AI910263 EST cluster (not In UniGene) 2.09 

338857 CH22_DJ32I10.GEWSCAN.1-1 2.09 

335374 CH22_FGENES543.12 Z09 

308766 AI808510 EST singteton (not in UniGene) with exon 2.09 

331027 N48584 Hs.6168 KIAA0703 gene product Z09 

337853 CH2^_EIVfcAC005500.GENSCAN^7.1 2.09 

302498 NM.002991 EST cluster (not In UniGene) with exon h Z09 . 

312607 A1337440 Hs. 169375 ESTs 2.09 

314309 Z44049 Hs.184352 ESTs; Weakly similar to cDNA EST EMBLD3 2.09 

311695 A1142078 Hs.135562 ESTs 2.09 

333280 CH22_FGENES.126_2 Z09 

333518 CH22_FGENES.173_3 Z09 

337199 CH2Z.FGENES.5B3-1 1 2.09 

337819 CH22_EMAC005500.GENSCAN.13-9 2.08 

300546 AA214450 Hs.250913 ESTs 2.08 

322577 AA354452 Hs.59075 ESTs; Wealdy similar to WD40 protein Cla 2.08 

338028 CH22_FGENES.67^.1 ZQ8 

300238 AI394673 Hs.254030 ESTs 2.08 

307429 AI243573 EST singleton (not in UniGene) with exon 2.08 

326444 CH.19_hsgl|5BS7385 2.08 

310641 AI345597 Hs.254727 ESTs 2.08 

337833 CH2Z_C20H12.GENSCAN.32-1 Z08 

336008 CH2Z.FGENES.K8_6 Z08 

339030 CH22_BA59H18.GENSCAN.24-1 Z08 

333952 CH22_FGENES.303_8 Z08 

329149 CHX.hs gP68685 Z08 

335192 CH22_FGENES.507_7 Z08 

308225 AJ557713 Hs.177592 rfcosomal protein; large; P1 Z08 

330519 M94172 Hs.69949 calcium channel; voltage-dependent; I ty Z08 

331809 AA402482 Hs.97312 ESTs Z07 

324837 AJ003669 Hs.246171 ESTs Z07 

332608 D00749 Hs.36972 CD7 antigen (p41) 2.07 

327291 CH.01Jisgi|5867483 2.07 

315936 AW069807 Hs.247094 ESTs; Moderately sirnflarto ID ALU SUB Z07 

317917 AI143593 H&129419 ESTs 2.07 

328S74 CH.07_hsgi|5868254 2.07 

338654 CH2Z_EMAC005500.GENSCAN.46fc55 Z07 

320828 AJ012590 Hs.194728 hexose-&*hosphate dehydrogenase {glucos Z07 

337896 CH2Z_EMAC005500.GENSCAN.56-3 107 

335310 CH22_FGENES.532_3 Z07 

300076 AW074835 Hs.145223 ESTs Z07 

303588 AU046182 EST duster (not In UniGene) with exon h 2.07 

328848 CH.07JisgiI6381921 Z07 

318723 C1806O EST duster (not In UniGene) 2.07 

335352 CH2Z.FGENESJ39.5 Z07 

339316 CH22.BA354112.GENSCAN.22-15 Z06 

335873 CH2ZJGENES.631J 2.06 

335261 CH2ZJ=GENES^20_2 2.06 

322032 AL079807 EST duster (not In UniGene) ZOO 

308771 AI809301 EST singleton (not in UniGene) with exon Z06 

310024 A1252661 Hs.145224 ESTs Z06 

320555 R36212 Hs.235534 ESTs 2.06 

319314 T74062 EST duster (not in UniGene) Z06 

334642 CH22_FGENES.417_9 Z06 

335767 CH22_FGENES.607_1 Z06 

336159 CH2ZJGENES.707.3 2.06 

336358 CH22_FGENES.818_1 Z06 

334687 CH2Z/GENES.419J2 Z06 

339389 CH22_BA232E17.GENSCAN.4-7 2.06 

335898 CH22_FGENES.635J5 Z06 

328847 CH.07_hsgii6381920 ' Z06 

313431 W91684 EST cluster (not in UniGene) Z06 

313270 AI374993 Hs.159611 ESTs 2.06 

339211 CH22.FF1 13D1 1 .GENSGAN.WS Z06 

333860 CH2Z.FGENES290J9 Z06 

308952 AI868157 Hs.224226 EST 2.06 

305471 AA743947 EST singleton (not in UniGene) with exon 2.06 

300619 AA991438 Hs.233293 ESTs 2.06 
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302962 


AI693349 


Hs.228981 


EST 


2.06 


332446 


AA112799 Hs238756 


ESTs; Weakly similar to unknown [H^aple 


206 


334972 






CH2^FGENES.468J 


205 


330196 






CH^5jj2glj6165140 


205 


304754 


AA579795 




EST singleton (not in UniGene) with exon 


2.05 


309726 


AW248521 Hs.195188 


gtyceiaJdehyde-3-phosphate dehydrogenase 


205 


333939 






CH22_FGENES.301_5 


2.05 


304836 


AA587008 




EST singleton (not in UniGene) with exon 


205 


302087 


AA324163 




EST cluster (not in UniGene) with exon h 


2X15 


308424 


AI650714 




EST singleton (not in UniGene) with exon 


2.05 


304347 


AA176914 




EST singleton (not in UniGene) with exon 


205 


333141 






CH22.FGENES.85J 


2.05 


310573 


AW292180 Hs.156142 


ESTs 


205 


337565 






CH2£_C65E1.GENSCAN.1-11 


205 


304295 


AA084082 




EST singleton (not in UniGene) with exon 


2.05 


326824 






CH.20Jisgi|5867553 


205 


32S443 






CH.19_hsgij5867385 


204 


339012 






CH2LPA59H18.GENSCAN.1W 


2.04 


337384 






CH22.FGENES.745-1 


204 


332326 


T79623 


Hs.111787 


ESTs 


204 


303706 


AW501525 




EST duster (not in UniGene) with exon h 


204 


336046 






CH22_FGENES.679_8 


204 


301770 


R05887 




EST duster (not in UniGene) with exon h 


204 


326726 






CH20Jisgi|5867597 


204 


330485 


M11186 


Hs.1 13216 


oxytocin; prepro* (neurophysln 1) 


204 


332956 






CH2LFGENES.48J3 


204 


300021 


M97935 




AFFX control: STAT1 


204 


306872 


AI086920 




EST singleton (not in UniGene) with exon 


2.03 


302744 


L03151 




EST duster (not in UniGene) with exon h 


203 


338507 






CH22.EMAC005500.GENSCAN.390-11 


203 


334020 






CH22LFGENES.317J 


103 


333870 






CH22_FGENES.291_3 


203 


330552 


U40223 


Hs.248157 


pyrimidinerglc receptor P2Y; G-protein c 


203 


335486 






CH22_FGENES.570JB 


203 


339374 






CH22_BA232E17.GENSCAN.2-5 


203 


328384 






CH.07_hsgi|5868392 


2.03 


334690 






CH^J=GENES.420_3 


2.03 


310318 


AI733942 


Hs.145338 


ESTs 


2.03 


325893 






CR16_hsgi|5857088 


2.03 


331373 


AA435513 Ks.178170 


ESTs; Weakly similar to DUAL SPECIFOTY 


2.03 


329784 






CH.14_p2gi|5912597 


203 


335087 






CH22JGENES.488J1 


2.03 


310582 


AI336563 


Hs.254585 


ESTs 


2.03 


332611 


R06751 


Hs.1600 


chaperonin containing TCP1 ; subuntt 5 (e 


203 


339258 






CH22_BA354l12.GENSCAN.8-3 


203 


336851 






CH2LFGENES574-1 


203 


305596 


AA780664 Hs.8734 


ESTs; Moderately sirdar to llii ALU CLA 


203 


330364 






CHXp2gipi26882 


203 


302940 


AL137619 




EST duster (not in UniGene) with exon h 


2.03 


317349 


AA923657 Hs.126359 


ESTs; Weakly similar to (ill ALU SUBFAMl 


203 


309869 


AW300314 




EST singleton (not in UniGene) with exon 


103 


333422 






CH22_FGENES.147_2 


203 


325233 






CH.10Jisgil6381943 


2.03 


330586 


U77968 


Hs.79564 


neuronal PAS domain protein 1 


103 


336725 






CH22.FGENES.88-1 


102 


334157 






CH22LFGENES340_7 


102 


303357 


AW006352 Hs.159643 


ESTs; Weakly simflar to MID [H.sapiens] 


102 


328533 






CH.07_hsgl|5868482 


102 


309210 


AB62817 




EST singleton (not in UniGene) with exon 


2.02 


327412 






CH.Q2Lhsgi}5867750 


102 


333172 






CH22_FGENES.94_7 


102 


334869 






CH22.FGENES.447J 


102 


301047 


AA971465 Hs.1 16136 


ESTs 


102 


329394 






CHXJisgiI6478817 


102 


301736 


F12128 




EST duster (not in UniGene) with exon h 


2.02 


335591 






CH22_FGENES.581J0 


102 


338234 






CH22LEMAC005500.GENSCAN^60.7 


102 


334433 






CH22J=GENES.385J 


102 


334904 






CH22_FGENES.452_18 


202 


318443 


A1939323 


Hs.157714 


ESTs; Weakly similar to NEUR ACETYLCHOLI 


300151 


AI243445 


Hs.189654 


ESTs 


101 


310348 


AI478563 


Hs.145519 


ESTs 


101 


310898 


AW3986B 


Hs.165742 


ESTs 


101 


332860 






CH22J=GENES.27_3 


101 


301699 


AI879117 




EST cluster (not in UniGene) with exon h 


2.01 


332554 


W96450 


Hs.23111 


phenytaianire-tRNA synthetase* 


101 
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327994 CK06Jisgq5868218 2.01 

315613 AW137420 Hs.192311 ESTs 2.01 

335356 CH2^FGENES541_3 2.01 

334028 CH2^FGENESJ18J 101 

335277 CH2^FGENES^23J 2.01 

308657 AI749855 Hs.236497 EST; Weakly similar to GLANDULAR KALUKR 2.01 

305913 AA876109 EST singleton (not In UnlGene) with exon 2.01 

323681 AW247730 Hs.102548 glucocorticoid receptor DMA binding feet 2.01 

333533 CH22_FGENES.175J0 2.01 

328753 CH.07_hsgI|5868298 2.01 

302397 L01694 H&21 1523 guanine nucleotide binding protein (G pr 2.01 

304643 AA526586 EST singleton (not In UnlGene) with exon 2.01 

333065 CH2a.FGENES.75J 2.01 

316192 AA904441 Hs.221286 ESTs 2 

302533 L36149 Hs^48116 chemoWne (C motS) XC receptor 1 2 

312988 AA813689 Hs.123436 ESTs 2 

333612 CH22J=GENES217J 2 

333615 CH22J=GENES217J0 2 

316085 AI027959 Hs.132300 ESTs 2 

337936 CH22„EMAC005500.GENSCAN.85-7 2 

330972 H18467 Hs.118S83 ESTs; Weakly similar to diaphanous 1 [H. 2 
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Table 20: B survivor vs Mets 



- Up in Mets 



PCT/US02/06001 



Pkcy: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigenc Title: Unigene gene title 



Pkey 


ExAccn 


UniGJD 


Complete Title 


Ratio BS/Met 


316625 


AA780307 


Hs.122156 


ESTs 


0.28 


316076 


AW297895 


Hs.116424 


ESTs 


0.3 


315943 


AA699756 


Hs.117335 


ESTs 


038 


317198 


AI810384 


Hs.128025 


ESTs 


038 


320082 


AA487678 


Hs.189738 


ESTs 


0.39 


313510 


A1147291 


Hs.154006 


ESTs 


0.39 


323683 


A1380045 


Hs^25033 


ESTs 


039 


316558 


AW402877 


Hs.90372 


ESTs 


0.4 


310264 


AI915771 


Hs.148867 


ESTs 


0.4 


314945 


AW276866 


Hs.192715 


ESTs 


0.41 


313403 


W86995 


Hs.1 13157 


ESTs 


0.42 


321505 


H73183 


Hs.129885 


ESTs 


0.43 


312171 


AW444619 


Hs.138211 


ESTs 


0.43 


324585 


AI823969 


Hs.132678 


ESTs 


0.44 


316695 


AA809844 




EST duster (not in UniGene) 


0.44 


319818 


AA825819 


Hs.1 36952 


ESTs 


0.44 


337522 






CH22.FGENES.819-1 


0.45 


324714 


AA574312 


Hs.245737 


ESTs 


0.45 


315060 


AA551104 


Hs.189048 


ESTs 


0.46 


300546 


AI026836 


Hs.114689 


ESTs 


0.47 


304483 


AA431441 




EST singleton (not in UniGene) with exon 


0.47 


313095 


AM22367 


Hs.163533 


ESTs 


0.47 


306501 


AA987294 




EST singleton (not in UniGene) with exon 


0.47 


329086 






CHJUisgil5868604 


0.47 


320176 


AA167566 


Hs.133325 


ESTs 


0.47 


320418 


AI674461 


Hs.199638 


ESTs 


0.47 


302982 


W92391 


Hs.198222 


ESTs; Weakly slmQarto C2H2-typezincf 


0.48 


315609 


AW207535 


Hs.224012 


ESTs 


0.48 


317056 


AA904908 


Hs.250643 


ESTs 


0.48 


314361 


AL038765 


Hs.161304 


ESTs 


0.49 


315169 


A1371390 


Hs.158667 


ESTs 


0.49 


323743 


AA324992 


H&257168 


ESTs 


0.49 


313903 


AW167439 


Hs.190651 


ESTs 


0.49 


315061 


AA551196 


Hs.188952 


ESTs 


0.49 


300969 


AI140799 


Hs.76230 


ribosomai protein S10 


0.5 


331950 


AA454595 


Hs.99369 


ESTs 


0.5 


315076 


AI623817 


Hs.168457 


ESTs 


0.5 


300975 


A1283548 


Hs.149668 


ESTs 


0.5 



i 
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T ABLE 1-20A 



Table 1-20A, shows the accession numbers for those pkeys lacking unigenelD's for Tables 
1-20. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accesssion 

108446 112224 J AAQ85383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978AA075372AA128503 
108474 116896J AA1 15179 AA079667AA1 15897 AA079771 

100835 10605.34 BE259039W29128AW410299 X72990BE246492NM_005243 X66899 AI909006 AW248151 AL031 186 AA01 2966 BE273549 
BE311429 BE253102 Y07648 BE536102 BE256663 BE261240 BE312156 BE618412 BE257322 BE620446 AW606629 
AA376777 AA325384 BE256808 BE251039 BE257578 BE275352 AA357169 AW403562 AA204995 M093259 W95953 
BE256279 BE336683 BE252465 BE251266 AA380754 BE294942 AA380941 AA380999 BE297164 BE249995 BE294719 
BE295372 AI270673 BE305132 BE563752 BE295357 AI525421 BE263980 AA057505 AA02091 5 BE266318 BE208948 
AJ474020 BE296420 BE297374 BE408545 BE019366 BE407372 BE266180 BE279437 R58233 T19567 BE300738 AW381 179 
AA357571 AW361285 AA436908 AA301019 AA301Q22 N20202 BE408777 BE548638 BE167415 AA071260 BE088429 
BE280092 W231 17 T19568 R51681 AW402216 W22784 BE1 85607 AI457224 BE544120 AL134874 S72820 AA375079 D51 319 
AW818280 BE514686 AW853024 BE563744 AA300469 TD7592 BE6221 90 BE272834 W21781 BE315450 BE542367 BE393120 
AA988441 H55137 BE562296 BE622502 BE395960 AA329733 AA332348 AI768317 AA456866 AI497832 AW878437 AA857042 
U18018 BE621418 AI818790 AI949507 BE397693 AI885545 AI858854 A1355147 BE169028 S62138 AW732191 AA856891 
BE266060 X71427BE268557AH^OAW001288AJ799B34A!823498AA07134^ 

U35622 BE314249 BE264915 AI538591 AI538385 AW090025 BE384754 AI8B8689 AW778800 AI925273 AA075797 AW949130 
AV660275 AW438697 A1587137 A1524121 AA806249 AW628247 AA808241 AI244388 AI761 125 AW1 17672 AA911782 
A1129250 AA654447 H55291 BE258050 BE206162 W95867 AA857187 AI871378 AI560103 AW103827 A1220929 AW149949 
BE465561 AI302857 AW168841 D82190 AW249814 A1623432 AI687358 AW951077 R51592 W60458 A1092863 AW474693 
D12765 AI911646 D82208 082187 AW074031 AI358527 AW338497 AA970893 AW072573 AA205364 AI858886 AA012830 
AW148763 AI863056 AA548656 BE250325 AI016994 AI864005 BE046122 AI497746 C75340 R58896 082141 AW168240 
C19048 AI741090 D29465 AI222365 AA948288 AI583522 AW572212 A1091290 AA582727 AA579897 AA570629 W60883 
AW516989 AL038160 AA577334 A1865872 AA994043 AA922583 AA464778 AA209178 AI829479 AI370235 BE246529 
AA384177 AA456255 AI699730 W60554 AL035744 AA662042 R32756 AI886886 AA993087 AI289479 AA627840 AA464184 
AI6195Q3 R32755 AW07535B AI432315 AA457024 AA020865 R92132 AA454629 AA746059 AA454643 AA456240 AA826984 
BE163738 A1806470 AI991074 AI802560 AA587095 AA558714 AA968521 N87780 AI538246 N71794 AV661738 AI368903 
AA362570 AI989445 A1674962 S75762 BE245204 AA975296 D20123 AW005704 AA693328 AA582270 AI918474 AW205707 
AI696299 AA220990 AA101538 T29030 H27201 AW262526 AI610530 AA126840 AA 126790 X92120 AW367868 BE299644 
BE299451 AA476561 BE300044 AA134363 BE295222 AA307504 N42337 AA319098 N395Q2 AW964461 N57241 BE299049 
N86332 R51 1 56 AA085859 T75212 AA133939 AA147129 AA156161 BE543953 BE538848 AA133676 BE299745 AA135050 
AA218535 AW40S401 AW411287 BE41052B C01410 NMJ04083 BE314959 AA836413 AA085862 AW024370 AA471059 
AW467508 AA001025 AI828231 AA633221 195517 AA147038 AA476447 AW027O12 AW078627 BE513200 AI192297 
AA686279 AW081808 AA316185 AA010506 A1269929 W93139 AI682935 AA609555 AA376028 Al 093877 AA999997 AA730698 
AI143923 AW575315 AA890550 AA494353 AW576601 AI796336 AA826130 AA609207 AI539618 A1088539 A1089090 AA825505 
AA632978 AA015892 AW204713 AA156495 AA824613 AA133630 N29826 AA527476 AI633352 T27908 AA134364 AA133940 
AW043601 H37775AA772375AA057871 AA047888 AA054225 HB6568 AA001511 H25718 AW189507 AA165589 AA054433 
H85549 AA16548S AA058972AA454911 AA464064AA493802 AA428253 R85508 AW302469 AJ61 1812 BE 162582 F11073 
T95518 N26811 AI783929 H40669 AW61 1745 AI658803 R51042 R45276 AA528386 AA782875 AW880218 AL138391 AA314536 
AW949338 AA149466 AA149552 AI346513 AA216776 BE349131 AW007654 AI141803 AA622688 AJ1B5131 AW057635 
AA1 01539 AA627986 H27202 A1536847 W93084 AI9731 48 AI246788 AW5721 08 A1459414 AA454835 AA612707 AA430746 
AI084991 AA010400 AA856636 AA463928 A1248310 R07170 AA834033 D12244 AI655670 AA054350 AA639480 AI702067 
AI475389 

100643 3931J NMJW5032 M34427 AA332167 AW409711 AL1 19718 BE297581 BE299855AA082284AA226855M149568AW391953 
M22299 BE163594 AW847881 AW366993 BE142871 AW847885 AW604137 AW847753 AW847886 AW376442 U48350 
AW607478 AA373011 AA334080 BE294177 AL121355 AA302236 BE540666 BE170588 AA346884 BE541512 AA226818 
AA082001 AA366490 AW604122 AA205784 AW607791 BE168496 AA058497 T64373 BE165633 AW802804 AW847878 
AA187408 AA088397 AI751745 AA344103 AA034463 AI906008 AA363580 AA379193 AI332642 AI143569 W52748 W52754 
AA385532AA085967 F05943 AA363422 AA133444 AA133477 AA029541 N48387 N83348 AA376066 M147671 W70187 
AA316255 BE174987 AA452776 AA089605 AL047776 BE162673 H39532 BE168406 AA357654 AA328728 AW813442 D57844 
AW839748 AW839663 D57357 AA334536 AW268674 AW950788 AW409888 A1160544 D57821 AW664382 D25884 AI755101 
AW130365 AI609094 AI984064 A1806523 AA492516 AI755258 BE157210 AA374884 AI983923 AI831088 AA706501 AI754957 
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AI688651 A10B8623 A13361 14 N38752 T56004 AAB45200 AA858377 BE157397 AW0B9347 AA045366 AW31691 8 AW130372 
AI355398 BE157336 AI751746 A1375820 AA129935 W60002 N24781 AI805924 W60Q09 AA044283 AA121 161 AI539277 
AA301885 AW019944 AA133445 M10110B AA033559 W70050 AA617751 A1986261 AI023234 D82235 AAQ85846 AW754181 
D82093 D82100 AA147653 AA600256 057884 AI753S82 AI568050 AI146490 AW302280 AI433051 AA329188 AW572150 
AW1 66345 AI337981 AA778973 N67577 AA227207 AA838281 C06190 AL046897 AI217662 AI752979 AW627538 AI127171 
AI440461 T64184 AA845190 AA227111 AA877394 R6Q962 AA505646 AA770545 AI896264 AA953747 AA904094 AA058318 
D57026 T17158 AA578545 AW085082 BE148939 AW815069 BE152843 BE149068 BE149036AW815073AW753691 BE149040 
AW815065 BE152842 BE149072 AW753692 AW815055 BE152837 BE152849 BE152840 AW815070 BE152829 BE152846 
BE148972 BE149042 BE149074 AW753668 BE152832 BE152841 BE149082 BE149050 AA347261 BE152852 BE152847 
R65797 F02189 AA483448 AI954410 AA8S5375 BE152835 BE152838 BE152839 T17300 BE152844 BE152833 BE152834 
AA029542 AI567601 A1362353 BE162140 AI381384 BE152851 D57038 D57043 A1418363 AA133478 BE149051 BE149083 
BE152850 BE149052 BE149084 AA886686 BE149064 BE149032 AA044093 AA129934 AA303976 BE157211 AA187291 
BE152830 AA046552 BE149047 BE149079 BE149033 BE149065 BE149044 BE149076 BE149053 BE149085 BE149034 
BE149066 BE149048 BE149080 BE149038 BE149070 BE149045 BE149077 

100570 22023.1 AA332178 BE259177 BE545625 T09105 S62076 M16424 NWL000520 BE244309 F13516 BE251567 BE514981 AL1 19537 
AA336739 BE261801 AA278642 N32708 T77034 W24621 W42478 AW630382 AW856214 AA134234 M13520 BE379212 
AA287459 BE019379 BE297192 BE162970 AW405668 AW403322 BE272280 BE20S703 BE304428 BE162807 BE162828 
BE162687 BE078944 BE163025 BE162878 BE162909 BE162898 BE162791 BE162880 BE073563 BE163086 BE162898 
BE162770 6E073565 BE162908 BE162913 BE162947 BE162803 BE262199 BE162811 BE080897 BE315095 AW2Q6024 
AI291054 BE087364 AL046839 AA304422 AA847660 AA669876 BE392765 AI557798 AW026644 AW151258 AA998314 
AI828860 AI571 158 T61 941 AW103503 AW172698 AI9231 1 5 AI823709 T621 67 AW771381 AW151782 AI799284 AW242271 
AA128031 BE261306BE312241 AI6748B0 BE261057 AA6306B4AA831305AI139546AWD82447AA91^54AA916855T05970 
AA599395 AA921680 A1244674 AKW1920 AA424998 A1362999 W42543 T51260 A1362486 AI699366 AI827925 AI027381 
AKJ27370 AI209049 AA782220 AI334014 AI279051 AI21771 1 AI674210 A1193370 AI701683 T23782 AI927545 AI784291 
AA128007 A1370630 AI972736 AAB53763 N32379 AI916746 AA639633 AA907603 AM79452 AW950971 T28985 A1685825 
AA563654 AA745291 AW089417 F10858 AI354227 R38108 AI668647 AA994088 AI740910 AW880973 AI739410 AI480346 
N78987 AI473892 BE162903 BE254430 BE260426 AA650012 AW006426 

100673 21517 J AW403342 AW2489B6 BE561709 AA357312 BE311834 BE389496 BE294887 AW732696 BE047868 AI702383 BE019155 
AI702367 BE408966 BE280458 BE313759 BE513492 BE535404 BE280258 AC005263 NM J07165 L21990 AW732711 
AI564920 AW249094 BE265365 AW607186 AW607346 BE005217 H27211 U46230 BE260066 BE207043 BE546782 AW248659 

108559 41469.9 AAD85228 AA085161 

108569 118606.1 AA082885 AA1 14265 AA085398 AA1 13184 

100700 17137.1 AA932794 BE540417 AW409802 AW410765 BE296651 BE294197 BE164813 AW381886 AW381806 AL048654 AW403058 
BE207228 AA464654AW966967AA326831 BE407277 BE408669 AA476527 AA1 1 5576 AA359697 AA476357 AA449939 
BE263719 AL045304 W21442 R2B919 BE395990 AA252273 AI346812 BE538487 AA507160 W93950 N42025 AI088439 
AL134931 AA031524AI887287 AW470017AM76423AA464553AW410766 AI569421 AA577476 AI248935 AI912371 
AW615674 AA824237 AA807746 AA827377 AA890268 AA476309 AI086424 AW409698 AA031553 AW451901 AI520934 
AW050554 R49825 N30302 AA532541 W94004 N93402 AA1 15549 AA331202 L25665 NM.005275 AA436745 AI122671 R49779 
R18508 

100734 35197J AI607481 AI500404 AI092260 BE346962 AI143675 AA772399 AA772398 AI368565 AI379172AI083781 AIB71363 AA843793 
NM-000141 M87770 X52B32 M55614 271929 W05259 AA548551 AI498743 BE081295 BE162251 F05643 AI127918 H83199 
W07463 BE551725 R28404 AW206461 AW590506 A1885536 N69800 R93496 R25381 AA443093 AI143063 A1284647 AI703144 
AA309032 AA309031 AW949426 AW949428 AI638387 AI638356 R77173 R38513 T89263 F01900 H87979 R70205 AA664355 
AA235910 AA033657 AW36212 N50463 AI433805 AK381876 AI421090 AW020818 AI559529 A180742O BE154202 AA889062 
AA723410 BE537575 AA236812 AI218552 A1264866 A1290617 AA424365 AA424505 AW073347 AA032183 A1142488 N55322 
AI884363 A1336070 N67307 AA608928 T94993 AW514184 AA724695 N66630 AI379638 AI274671 AW628470 AA235346 
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328236 c_6 Jk 

335455 CH22_2806FG.562_15_UNKJ 
328243 c_6_hs 

335468 CH22_2819FG_567_4JINK_EM 
335470 CH22J821FG_568.3JJNK.EM 
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335482 CH22J834FG.570J1.UNKJE 
335485 CH22_2837FG_570_17JJNK_E 
328271 c_6_hs 

328276 c_7_hs 

328277 C.7J1S 
328282 c_7_hs 
305403 AA723748 

335517 CH22J872FGJ71.34JJNK.E 
328305 cJJ& 

335523 CH22_2B78FG_572_3 _HNK_EM 
335527 CH22_2882FGJ72_7_UNKJM 
305443 AA736653 
328314 c_7_hs 

335536 CH22_2891FGJ74_2JJNieEM 
305454 AA738413 
328328 cJJs 
305464 AA742425 

335565 CH22_2921FG_579J_LINK_EM 

335566 CH22_2922FG_580_1_LINK_EM 
305486 AA748889 

335585 CH22_2943FG_J581_24_UNKJ 
335587 CH22L2945FGJ81 J6JJNKJE 
335593 CH22J951FG.581 _32JJNK_E 
335606 CH22_2964FG_582_3J-NK_EM 
305536 AA770682 
305547 AA773111 

335634 CH22_2S94FG_584_14JJNK^E 
328420 C_7_te 
328428 c 7 hs 

335651 CH22_301 1 FG.590 J JJND^EM 

335652 CH22_3012FG_590_3JJNK_EM 
328436 c 7_hs 

328444 q_7Jis 

335667 CH22J027FG_530J8_UNKJE 
328462 c_7_hs 
328467 c.7_hs 

335687 CH22_3048FG_596J_UNKJEM 
335690 CH2L3051FGJ96JJJNKJM 
328474 cl7 Js 

335692 CH22.3053FG_596JJJNK.EM 

335693 CH22.3054FG - 596_8_UNK.EM 
328484 c_7_hs 

335700 CH22J061FG.598JJJNIUM 
305621 AA789095 
305632 AA805276 

335720 CH22.3081FG_599_23JJNK_E 

335721 CH22JQ82FG_599_24_LINK_E 
328504 C.7J* 

328506 c_7_hs 

328507 c_7_hs 

335733 CH22_3095FG.601JJJNICEM 
335739 CH22J102FG_601JOJJNICE 
335745 CH22_3108FG.601.16_UNK_E 
335747 CH22J111FG_601 JOJJmj. 
335750 CH22.3115FG_6Q2_4_UNK_EM 
335755 CH22J3122FG_604_4JJNK_EM 
328544 c_7_hs 

335768 (M22_3137FG_607_2JJNK_EM 
305686 AA812726 
328552 c_7_hs 

335774 CH22J143FG.607JOJUNK-E 

328557 c_7_hs 

328558 C.7JIS 

335777 CH22J146FG_607J3JJNieE 
305697 AA814956 

335782 CH22_3151FGJ09.4JJNK_EM 

335783 CH22_3152FG.610.3JJNK.EM 

328569 c_7.fts 

335787 CH22_3156FG.61U_LINK.EM 

328570 c.7Jis 

328581 c_7Jis 

328582 c.7_hs 
328592 c_7_hs 

337011 CH22_4876FG_427.6_ 
337023 CH22_4894FG.433.12. 
337032 CH22_4910FG.438_3_ 
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337069 CH22J967FGJ48JL 

337092 CH22J016FG_465J2_ 

337093 CH22_5017FG_465_18_ 

337094 CH2^5018FG_465_19.. 
337097 CH2L5028FGJ71J. 
305700 AA815428 

335806 CH22.3178FG.616.8JJNKJM 
335817 CH2^3189FGJ18JJJNieEM 
328607 c_7Js 

335827 CH22J200FG 620JJUNK.EM 

335831 CH2*_3204FG_620J_UNK_EM 

335832 CH22J205FG_620_6_UNK_EM 
328620 cl7Jb 

328624 <L7_hs 
328636 c.7 Je 

335863 CH22J238FG.629JJJNKJM 
305782 AA844730 
305787 AA845035 
328662 c_7_hs 

335895 CH22.3272FG.635.3.UNK.EM 

337100 CH22_5031FG_472JL 

337114 CH2L5060FG_494_17_ 

337121 CH22_5096FG_519_1_ 

337132 CH22L5112FG_526_3_ 

337168 CH2L5188FG.582J8. 

307085 AJ160868 

337170 CH22_5190FG_564J_ 

337172 CH22_5192FG.565^_ 

307090 AI161024 

305803 AA846052 

305808 AA853958 

305816 M854776 c 

335902 CH22J279FGJ35J0JJNKJ 

335920 CH22L3297FGJ36J6JJNK.E 

305841 AA860346 

305867 AA864572 

335956 CH22_3334FG.647J.UNK_DJ 
305877 AA865649 

335968 CH22_3347FG_652JJJNKJXJ 
335971 CH22^3350FG_652_4JJNK_DJ 
335975 CH22J354FG_652AUNKJ>J 
335980 CH2t.3360FG_653JJJNK.DJ 
328768 q_7Jb 
328770 c_7_te 

335993 CH22L3373FG_656_6_UNK.DJ 

335998 CH2^3379FG_656_16_LINKJ) 

335999 CH22L3380FG_657JJL!N»eOJ 
328791 Q_7Jis 

337203 CH22_5256FG_591_3_ 

337204 CH22L5261FG_595_L 
307120 AI184343 

307140 AI185762 
307177 AI188864 
305903 AA873085 
305925 AA877883 
328803 C_J_hs 
330002 c16_p2 
328810 <L7_hs 
328820 cJJs 
330021 C16JJ2 
305967 AA886428 
328835 CJJts 
305971 AA686874 
328841 <L7Jis 
305984 AA887654 
328851 cj_hs 
328859 cj_hs 

330057 C17JJ2 

330058 c17j>2 
305999 AA889603 
307215 AI193189 
307262 AI202100 
307318 AI208577 
307380 AI222985 
307433 AI244895 
307437 AI245683 
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307556 A1281651 
307556 AJ281998 

337645 CH2^.5960FG_UNK.EMAC0O 
337657 CH22_5976FG_UNK^EMAC00 
307574 AI283549 
307588 A1285535 
307592 AT285739 

337685 CH2^6Q20FG_UNK.EMAC0O 
337695 CH21.6033FG_UNK_EMAC0O 
337697 CH22_6037FG_UNK_EMAC00 
307606 A1290006 



307629 AJ30Q246 

307642 AI302103 

307643 AB02124 
307646 AB02236 
307703 AJ318588 
307728 AI335557 
307752 AI339447 
307799 AI351112 
309005 AI884454 
307864 AI367417 
307877 A1368880 
307899 AI380270 
309108 AI925949 
309169 AI949216 
309181 A1951727 
307904 AI381019 
307912 AE382224 
307918 AI383496 
307954 AW19692 
307961 AI421059 
307992 AI434166 
309206 A1961962 
309233 AI971416 
309245 AI972447 
309247 AJ972768 
309275 AI989570 
309324 AW015373 
309333 AW025709 
309343 AW028652 
309351 AW057547 
309403 AW032954 
309533 AW151131 
309592 AW172384 
325271 c11Ju» 
325285 c11_hs 
325289 clljis 
309600 AW182066 
309605 AW182800 
309759 AW268822 
309767 AW271805 
309815 AW292760 
309859 AW298760 
302681 31257J 
302683 31326J 
309958 AW444488 
309977 AW451663 
309985 AW452919 
302727 32493J 
302747 328I3J 
304022 T02990 

304055 R07994 

304056 R08577 
304060 T61464 
304125 H40976 
304127 H42981 
304134 H54627 
304195 N35382 
302952 39444J 
302996 41195J 
304269 AA069029 
304324 AA137045 
304330 AA157834 
304424 AA293494 
304465 AA421948 



X97550X97552 
X85153T63701 



L10141 L10151 L10148L09095 
AF062275L03830 



AF103179 U82961 
AF054663AF124197R70292 
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304467 
304480 
304485 
304487 
304559 
304575 
304605 
304612 
304623 
304635 
304667 
304674 
304675 



304696 
304707 
304731 
304734 
304745 
304746 
300009 
306012 
306053 
306081 



304811 
304813 
304817 
304833 
304841 
304887 
306137 
306180 
306183 
306193 
304918 
304968 
306200 
306220 
306221- 
306300 
320789 
306351 
330435 



AA424703 

AA430373 

AA434076 

AA434241 

AA488050 

AA496437 

AA513225 

AA514207 

AA521331 

AA523976 

AA535602 

AA541735 

AA541740 

AA554263 

AA554758 

AA564846 

AA576085 

AA576428 

AA577771 

AA577793 

AA894560 

AA896989 

AA905312 

AA908472 

AA908609 

AA584361 

AA584540 

AA584712 

AA586504 

AA587541 

AA599355 

AA916176 

AA922503 

AA922622 

AA923457 

AA6Q2697 

AA614308 

AA926816 

AA928363 

AA928686 

AA937573 

252516J 

AA961356 

41165J 



330436 10605.34 



R78712AA603646R78713 

U63836 AW842139 X74956 U78550 AW840802 X74954 AW388241 AW842709 AF253321 X74955 X74370 AW353799 
BE073386 AI791962 AA587390 AW840865 

BE259039 W29128 AW410299 X72990 BE246492 NMJ05243 X66899 AI909006 AW248151 AL031166 AA012966 BE273549 
BE31 1429 BE253102 Y07848 BE538102 BE256863 BE261240 BE312156 BE618412 BE257322 BE620446 AWB06629 
AA376777 AA325384 BE256808 BE251039 BE257878 BEZ75352 AA357169 AW403562 AA204995 AA093259 W95953 
BE256279 BE336683 BE252465 BE251266 AA380754 BE294942 AA380941 AA380999 BE297164 BE249995 BE294719 
BE295372 A1270673 BE305132 BE563752 BE295357 AI525421 BE263980 AA057505 AA020915 BE266318 BE206948 
AI474020 BE296420 BE297374 BE408545 BE019366 BE407372 BE2661 80 BE279437 R58233 T1 9567 BE300738 AW381 179 
AA357571 AW361285 AA436908 AA301019 AA301Q22 N20202 BE403777 BE548638 BE167415 AA071260 BE088429 
BE280092 W231 17 T19568 R51681 AW402216 W22784 BE185607 AI457224 BE544120 AL134874 S72620 AA375079 D51319 
AW818280 BE514686 AW853Q24 BE563744 AA300469T07592 BE622190 BE272834 W21781 BE315450 BE542367 BE393120 
AA988441 H55137 BE562296 BE622502 BE395960 AA329733 AA332348 AI768317 AA456866 AI497832 AW878437 AA857042 
U1801B BE621418 A1818790 A1949507 BE397693 AI885545 AI858854 A1355147 BE169028 S62138 AW732191 AA856891 
BE266Q60 X714Z7 BE268557 AF095890 AW001288 AI799634 A1623498 AA071346 BE547662 BE261446 AI564543 BE559759 
U35622 BE314249 BE26491 5 AI638591 AJ538385 AW090025 BE384754 AIB886B9 AW778800 AI925273 AA075797 AW949130 
AV660275 AW438697 AI587137 A1524121 AA806249 AW628247 AA808241 AI244388 AI761125 AW117672 AA911782 
AI129250 M654447 H55291 BE258050 BE206162 W95867 AA857187 A1871378 AI660103 AW103827 AI220929 AW149949 
BE465561 AI302857 AW168841 D82190 AW249814 AI623432 AJ687358 AW951077 R51592 W60458 AI092863 AW474693 
D12765 AI911646 D82208 D82187 AW074031 AI358527 AW338497 AA970893 AW072573AA205364AI858886AA012830 
AW148763 A1863056 AA548656 BE250325 A1016994 A1864005 BE046122 AI497746 C75340 R58896 D82141 AW168240 
C19048 AI741090 D29465 AI222365 AA948288 A1583522 AW572212 AI091290 AA5B2727 AA579897 AA570629 W60883 
AW51 6989 AL038160 AA577334 AI865872 AA994043 AA922583 AA464778 AA209178 AI829479 AI37Q235 BE246529 
AA384177 AA456255 AI699730 W60654 AL035744 AA862042 R32756 AI8868B6 AA993087 AI289479 AA627840 AA464184 
AI619503 R32755 AW075358 AI432315 AA457024 AA020865 R92132 AA454629 AA746059 AA454643 AA456240 AAB26984 
BE163738 AI806470 AI991074 AI802560 AA587095 AA558714 AA96B521 N87780 AI538246 N71794 AV661738 A1368903 
AA362570 AI989445 A1674962 S75762 BE245204 AA975296 D20123 AW005704 AA693328 AA582270 AI918474 AW205707 
AI696299 AA220990 AA101538 T29030 M27201 AW262526 AI610530 AA126840 AA126790 X92120 AW367868 BE299644 
BE2S9451 AA476561 BE300044 AA134363 BE295222 AA307504 N42337 AA319098 N39502 AW964461 N57241 BE299049 
N86332 R51 156 AA085859 T75212 AA133939 AA147129 AA156161 BE543953 BE538B48 AA133676 BE299745 M135050 
AA218535 AW406401 AW41 1287 BE410528 C01410 ML004083 BE314959 AA836413 AA085862 AW024370 AA471059 
AW467508AA001025A1628231 AA633221 T95517 AA147038 AA476447 AW027012AW07B627 BE513200 AI192297 
AA886279 AW081806 AA316185 AA010506 AI269929 W93139 AI682935 AA609555 AA378028 A1093877 AA999997 AA730698 
AI143923 AW575315 AA890550 AA494353 AW576601 AI796336 AA826130 AA609207 AI539618 AW88539 AI089090 AA825505 
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AA632978 AA015692 AW204713 AA156495 AA824613 AA133630 N29826 AA527476 AI633362 T27908 AA134354 AA133940 
AW043601 H37775AA772375AA057871 AA04788B AA054225 H86568 AA001511 H25718 AW1B9507 M165589 AA054433 
H85549 AA165486 AA058972 AA454911 AA4B4064 AA493B02 AA428253 RB5508 AW302469 AI611812 BE1B2582 F11073 
T95518 N2681 1 AI783929 H40569 AW61 1745 A1658B03 R51042 R45276 AA528386 AA782875 AW880218 AL138391 AA314536 
AW949338 AA149466 AA149552 AI346513 AA216776 BE349131 AW007654 AI141803 AA62268B AI185131 AW057635 
AA101539AA627936 H272Q2 AI536847 W93084 A/973148 AI246788 AW572108 AM69414 AA454835 AA0127O7 AA430746 
AI084991 AA010400 AA856636 AA463928 AI248310 R07170 AA834033 D12244 AI855670 AA054350 AA639480 AI702067 
AJ475389 

330527 14658.-15 S77356 

330833 103550.1 AA046804 AA046821 

330855 111881.1 AA070316AA079318 

332099 genbanUA608983 M608983 

332240 genbanLN54803 N54803 
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TABLE 1-20B 

Table 1-20B, shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Tables 1-20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham 1. et a). a refers to the publication 

entitled The DNA 

sequence of human chromosome 22 " Dunham i. et al. r Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which axons were predicted. 

NLposffion: Indicates nucleotide positions of predicted axons. 



Pkey 


Ref Strand 


NLposition 


332792 


Dunham, I, etai. Plus 


73381-73768 


332843 


Dunham, I. etaJ. Pius 


1142859-1143494 


332909 


Dunham, I. eta!. Plus 


1946582-1946735 


332920 


Dunham, I. etai. Plus 


2007562-2007785 


332947 


Dunham, 1. etaL Plus 


2431726-2432006 


332949 


Dunham, 1. etai. Plus 


2436245-2436348 


332958 


Dunham, 1. etai. Plus 


2516164-2516310 


332992 


Dunham, L etai. Plus 


2699997-2701093 


332993 


Dunham, 1. etai Plus 


2701550-2701685 


333004 


Dunham, 1. etai Pius 


2759056-2759165 


333006 - 


Dunham, i. etai. Plus 


2762853-2762953 


333007 


Dunham, 1. etaL Plus 


2763569-2763709 


333132 


Dunham, 1. etaL Plus 


33580404358153 


333133 


Dunham, 1. etai Plus 


3360058-3360195 


333139 


Dunham, 1. etaL Plus 


3369495-3369571 


333152 


Dunham, 1. etai. Plus 


3612171-3612354 


333205 


Dunham, I. etai. Pius 


3942727-3943009 


333221 


Dunham, 1. etai. Pius 


3978070-3978187 


333225 


Dunham, 1. etai Pius 


399222949923B6 


333245 


Dunham, i. etai Pius 


41575874157668 


333246 


Dunham, 1. etai. Plus 


4162041-4162139 


333261 


Dunham, 1. etai. Pius 


43365974337752 


333272 


Dunham, L etai Phis 


43815614382212 


333281 


Dunham, 1. etai , Plus 


45062304506342 


333283 


Dunham, 1. etai Plus 


45142264514360 


333286 


Dunham, 1. etai Plus 


45168414516939 


333298 


Dunham J. etaL Plus 


45815374581947 


333306 


Dunham, 1. etai. Plus 


53962334396310 


333382 


Dunham, 1. etaL Phis 


49057964905913 


333403 


Dunham, 1. etai Plus 


49251404925256 


333420 


Dunham, Letai Plus 


49543024954465 


333428 


Dunham, 1. etai. Pius 


49738694974007 


333464 


• Dunham, 1. etaL Plus 


52107624211300 


333465 


Dunham, 1. etaL Plus 


52113854211858 


333486 


Dunham, 1. eta). Plus 


53962334396310 


333515 


Dunham, 1. etai. Pius 


55642994564851 


333520 


Dunham,!. etai Pius 


558613345B6296 


333566 


Dunham, total Plus 


59542264954473 


333567 


Dunham, Letal. Plus 


59591394959515 


333571 


Dunham, Letal. Plus 


60079164008058 


333572 


Dunham, 1. etai Plus 


60268964027189 


333576 


Dunham, Letai Plus 


60903454090721 


333577 


Dunham, 1. etai. Plus 


61239504124281 


333580 


Dunham, 1. etai. Phis 


61429354143145 


333587 


Dunham, 1. etai Pius 


62505994250966 


333588 


Dunham, 1. etai Plus 


62554454255779 


333591 


Dunham, 1. etai. Pius 


62858844286251 


333592 


Dunham, 1. etai. Plus 


62977314297976 


333593 


Dunham, 1. etai Plus 


63041324304428 


333594 


Dunham, 1. etai Pius 


63089904309450 


333599 


Dunham, 1. etai. Plus 


63378854338255 


333600 


Dunham, 1. etaL Plus 


63556294355925 


333601 


Dunham, 1. etai Plus 


63600754360442 


333607 


Dunham, 1. etai Pius 


65044314504690 


333608 


Dunham, 1. etai. Pius 


65108344511130 
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333619 


Dunham, I. etaL Plus 


333623 


Dunham, I. etal. Plus 


333625 


Dunham, I. etaL Plus 


333626 


Dunham, I. eta). Plus 


333627 


Dunham, I. etaL Plus 


333628 


Dunham,!. etal Plus 


333629 


Dunham, l. etal. Plus 


333631 


Dunham, I. etaL Pius 


333632 


Dunham, I. etal Plus 


333635 


Dunham, I. etal. Plus 


333637 


Dunham, I. etaL Plus 


333640 


Dunham, I. etaL Plus 


333642 


Dunham, I. etal. Phis 


333643 


Dunham, 1. etaL Plus 


333646 


Dunham, 1. etaL Plus 


333647 


Dunham, 1. eta). Plus 


333648 


Dunham, 1. etaL Plus 


333650 


Dunham, 1. etal Pius 


333652 


Dunham, 1. etal. Plus 


333653 


Dunham, 1. etaL Plus 


333654 


Dunham,!. etaL Pius 


333656 


Dunham, 1. eta!. Plus 


333657 


Dunham, I. etal. Plus 


333658 


Dunham, 1. etaL Plus 


333668 


Dunham, 1. etal. Plus 


333670 


Dunham, I. etal. Plus 


333680 


Dunham, 1. etal. Plus 


333682 


Dunham, 1. etaL Plus 


333698 


Dunham, 1. etaL Phis 


333710 


Dunham, 1. etal. Plus 


333717 


Dunham, 1. etaL Plus 


333727 


Dunham,!. etaL Plus 


333785 


Dunham, 1. etal. Phis 


333791 


Dunham, I. etaL Plus 


333859 


Dunham,!. etaL Phis 


333875 


Dunham,!. etaL Plus 


333879 


Dunham,!. etal. Plus 


333891 


Dunham,!. etaL Plus 


333918 


Dunham, 1. etaL Phis 


333929 


Dunham,!. etal. Plus 


333932 


Dunham, 1. etal Plus 


333983 


Dunham, 1. etaL Plus 


333987 


Dunham, 1. eta!. Phis 


333997 


Dunham, I. etaL Plus 


334010 


Dunham,!. etaL Plus 


334015 


Dunham, I. etaL Plus 


334017 


Dunham, 1. eta). Plus 


334026 


Dunham,!. etal Plus 


334030 


Dunham,!. etaL Pius 


334044 


Dunham, 1. etaL Plus 


334047 


Dunham,!. etaL Plus 


334055 


Dunham,!. etaL Phis 


334063 


Dunham, 1. etal. Phis 


334066 


Dunham, 1. etaL Plus 


334068 


Dunham,!. etaL Plus 


334076 


Dunham,!. etal Plus 


334091 


Dunham, 1. etal. Pius 


334106 


Dunham,!. etal. Plus 


334109 


Dunham,!. etaL Pius 


334111 


Dunham, 1. etal. Plus 


334115 


Dunham, LetaL Plus 


334118 


Dunham,!. etaL Plus 


334120 


Dunham, 1. etaL Phis 


334135 


Dunham,!. etal. Plus 


334235 


Dunham, LetaL Plus 


334239 


Dunham, LetaL Phis 


334244 


Dunham, 1. etal. Phis 


334257 


Dunham, I. etaL Plus 


334260 


Dunham, 1. etaL Pius 


334298 


Dunham,!. etal. Plus 


334342 


Dunham, LetaL Phis 


334354 


Dunham, 1. etaL Pius 


334430 


Dunham, 1. etal. Plus 


334435 


Dunham, 1. etaL Plus 


334451 


Dunham, 1. etaL Plus 


334504 


Dunham, LetaL Plus 



6562799-6562926 

65845594584956 

6603020^603310 

66141744614467 

66205844620903 

66290044629233 

66369154637205 

66509044651011 

66515204651658 

66636834663973 

66749684675134 

66883504688624 

670B7604709139 

67280534728343 

6739110 4739379 

67725024772779 

67874654787782 

67968524797128 

68094554809573 

68111304811392 

68167314816993 

68220874822406 

68313694831445 

68352824835474 

7011009-7011223 

7027945-7028181 

7071730-7071794 

7076641-7076760 

7205279-7205383 

72303147230476 

7308714-7308815 

7373219-7373311 

7775317-7775415 

7795972-7796082 

80412034041359 

81355054136179 

81469194147062 

81564374156709 

83071244307215 

847948^8479550 

84891244489205 

88135934813668 

88242454824376 

88666688867255 

89966964998236 

90554524055595 

91395164139634 

919654*4196681 

92884634288782 

9373898-9374065 

94281524428211 

96620774662270 

97319914732085 

97395684739680 

97462794746477 

98016134801693 

98723274872527 

10261155-10261841 

10267679-10267854 

10279365-10279531 

10316414-10316608 

10344273-10344384 

10402389-10403196 

10457085-10457183 

12983601-12983703 

13056569-13056693 

13159198-13159302 

13213243-13213429 

13223819-13223968 

13424763-13425914 

13646844-13646980 

13702598-13702747 

14269664-14270102 

14275597-14275689 

14315572-14315741 

14510206-14510398 
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334510 


Dunham, I. etal. 


334518 


Dunham, 1. etal. 


334525 


Dunham, 1. etal. 


334526 


Dunham, 1. eta!. 


334529 


Dunham, l.etai. 


334565 


Dunham, 1. etal 


334588 


Dunham, 1. eta). 


334590 


Dunham,!, etal. 


334612 


Dunham, I. etal 


334626 


Dunham, l.etai. 


334661 


Dunham, 1. eta). 


334664 


Dunham, I etal 


334666 


Dunham, l.etai 


334676 


Dunham, I. etal. 


334677 


Dunham, l.etai. 


334719 


Dunham, l.etai 


334730 


Dunham, 1. etal. 


334797 


Dunham, 1. etal. 


334819 


Dunham, tetal 


334855 


Dunham, 1. etal. 


334900 


Dunham, 1. etal. 


334906 


Dunham, l.etai 


334907 


Dunham, tetal 


334914 


Dunham, l.etai. 


334915 


Dunham, 1. etal 


334935 


Dunham, 1. etal 


335019 


Dunham, 1. etal. 


335036 


Dunham, 1. etal 


335049 


Dunham, l.etai 


335081 


Dunham, 1. etal 


335131 


Dunham, I etal. 


335157 


Dunham, l.etai 


335163 


Dunham, i. etal. 


335174 


Dunham, 1. etal. 


335180 


Dunham, 1. etal. 


335188 


Dunham, 1. etal 


335189 


Dunham, 1. eta). 


335193 


Dunham, I eta). 


335200 


Dunham, I etal 


335201 


Dunham, 1. etal 


335284 


Dunham, 1. eta). 


335311 


Dunham, I etal. 


335320 


Dunham, I etal 


335371 


Dunham, 1. etal. 


335377 


Dunham, 1. eta). 


335421 


Dunham, L etal 


335448 


Dunham, l.etai. 


335451 


Dunham, 1. etal. 


335455 


Dunham, I etal. 


335468 


Dunham, l.etai 


335470 


Dunham, 1. etal. 


335482 


Dunham, I eta). 


335485 


Dunham, 1. etal 


335517 


Dunham, 1. etal. 


335536 


Dunham, 1. etal 


335565 


Dunham, tetal 


335585 


Dunham,!, etal. 


335587 


Dunham, 1. etal 


335593 


Dunham, I etal 


335634 


Dunham,!, etal. 


335651 


Dunham, 1. etal. 


335652 


Dunham, 1. etal 


335667 


Dunham, 1. etal 


335687 


Dunham,), etal. 


335690 


Dunham, I etal 


335692 


Dunham, 1. etal 


335693 


Dunham, I etal 


335700 


Dunham, 1. etal. 


335720 


Dunham, 1. eta!. 


335721 


Dunham, 1. etal 


335733 


Dunham, 1. etal. 


335739 


Dunham, 1. etal. 


335745 


Dunham, 1. etal. 


335747 


Dunham, t. etal. 


335783 


Dunham, 1. etal. 


335968 


Dunham, 1. etal. 



Plus 


14522303-14522418 


Plus 


14630584-14630669 


Plus 


14781066-14761137 


Plus 


14787558-14787675 


Plus 


14788825-14788971 


Plus 


14989033-14989353 


Plus 


14992698-14993210 


Plus 


15033247-15033753 


Plus 


15170470-15170535 


Plus 


15299954-15300077 


Plus 


15477716-15477786 


Plus 


15501941-15502119 


Plus 


15504203-15504279 


Plus 


15516334-15516412 


Plus 


15517449-15517560 


Plus 


15778859-15779026 


Plus 


15967830-15967934 


Plus 


16383231-16383458 


Plus 


16758514-16758711 


Plus 


18402556-18402735 


Plus 


19315678-19315743 


Plus 


19323493-19323590 


Plus 


19336829-19336938 


Plus 


19495158-19495275 


Plus 


19505267-19506101 


.Plus 


20108247-20108373 


Plus 


20689525-20689582 


Plus 


20755250-20756375 


Plus 


20877184-20877309 


Plus 


21113871-21113937 


Plus 


21449834-21450003 


Plus 


21543302-21544341 


Plus 


21583508-21583655 


Plus 


21631301-21631447 


Plus 


21641226-21641603 


Plus 


21669118-21669328 


Pius 


21673403-21673472 


Plus 


21692208-21692362 


Plus 


21743499-21743881 


Plus 


21745249-21745664 


Plus 


22272583-22272712 


Plus 


22501602-22501676 


Plus 


22542132-22542246 


Pius 


22849529-22849808 


Plus 


2286990^-22870188 


Plus 


23250433-23250498 


Plus 


23477532-23477831 


Plus 


2348061143480711 


Plus 


23486127-23486216 


Plus 


23787245-23787367 


Plus 


23812877-23813365 


Plus 


24087956-24088114 


Plus 


24098953-24099123 


Plus 


24226461-24226531 


Plus 


2449878344499762 


Phis 


24881886-24882470 


Plus 


24988240-24988405 


Plus 


24991457-24991616 


Pius 


25037253-25037370 


Plus 


25163837-25164042 


Plus 


25317560-25317696 


Plus 


25325625-25325883 


Plus 


25345735-25345856 


Plus 


25445952-25446064 


Plus 


25455442-25455625 


Pius 


25468557-25468725 


Plus 


25471363-25471524 


Plus 


25512222-25512890 


Plus 


25596542-25596646 


Plus 


25599118-25599226 


Plus 


25686486-25686626 


Plus 


25698550-25698826 


Pius 


25718164-25718332 


Plus 


25723304-25723494 


Plus 


25941882-25942040 


Plus 


27743843-27744029 
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335971 


Dunham, Letal. 


335975 


Dunham, I. etaL 


335993 


Dunham, Letal. 


335998 


Dunham, I. elal. 


336011 


Dunham, Letal. 


336035 


Dunham, Letal 


336049 


Dunham, t. etai 


336055 


Dunham, 1. etal. 


336072 


Dunham, Letal 


336075 


Dunham, Letal 


336061 


Dunham, Letal. 


336083 


Dunham, Letal. 


336084 


Dunham, Letal. 


336086 


Dunham, 1. etai 


336087 


Dunham, 1. etal. 


336088 


Dunham, Letal. 


336089 


Dunham, Letal. 


336090 


Dunham, Letal. 


336091 


Dunham, Letal. 


336092 


Dunham, Letal 


336094 


Dunham, Letal. 


336102 


Dunham, Lata). 


336124 


Dunham, Letal. 


336125 


Dunham, 1. etai. 


336127 


Dunham, 1. etai. 


336128 


Dunham, Letal. 


336171 


Dunham, Letal. 


336214 


Dunham, Letal 


336306 


Dunham, Letal 


3363.16 


Dunham, Letal. 


336333 


Dunham, Letal. 


336367 


Dunham, Letal. 


336442 


Dunham, Letal 


336481 


Dunham, 1. etaL 


336489 


Dunham, Letal. 


336498 


Dunham, L etal. 


336502 


Dunham, Letal 


336516 


Dunham, 1. etal 


336548 


Dunham, L etal. 


336590 


Dunham, 1. etal. 


336593 


Dunham, Letal. 


336606 


Dunham, Letal 


336623 


Dunham, 1. etal 


336632 


Dunham, L etal. 


336633 


Dunham, Letal.. 


336634 


Dunham, 1. etal. 


336635 


Dunham, Letal 


336637 


Dunham, 1. etaL 


336638 


Dunham, Letal 


336654 


Dunham, 1. etal. 


336665 


Dunham, 1. etal 


336695 


Dunham,!, etal 


336705 


Dunham, 1. eta). 


336751 


Dunham, 1. etal 


336753 


Dunham, Letal. 


336757 


Dunham, Letal 


33S759 


Dunham,!. etaL 


336808 


Dunham, 1. etal 


336813 


Dunham,!, eta). 


336827 


Dunham, Letal 


336833 


Dunham, Letal 


336836 


Dunham, 1. etal. 


336878 


Dunham, Letal 


336898 


Dunham, letal 




Dunham, 1. etal 


336924 


Dunham, Letal. 


336934 


Dunham, i. etal 


336958 


Dunham, L etal 


336968 


Dunham, 1. etal 


336985 


Dunham,!, etal. 


337032 


Dunham, 1. etal. 


337121 


Dunham, Letal 


337132 


Dunham, Letal. 


337168 


Dunham, Letal 


337170 


Dunham, 1. etal 


337172 


Dunham, Letal 



Plus 


27752808-27753017 


Plus 


27801321-27801391 


Phis 


28006396-28006571 


Plus 


28026247-28026356 


Plus 


28570760-28570900 


Phis 


29016748-29017410 


Plus 


29124451-29124606 


Plus 


29166352-29166573 


Plus 


29269409-29269575 


Plus 


29343938-29344050 


Plus 


29372270-29372407 


Plus 


29379738-29379881 


Plus 


29380664-29380792 


Phis 


29385594-29385713 


Plus 


29387122-29387253 


Plus 


29389001-29389124 


Plus 


29401421-29401625 


Plus 


29413020-29413162 


Phis 


29415486-29415617 


Plus 


29438816-29438948 


Plus 


29567934-29568099 


Plus 


. 29681500-29681687 


Plus 


30053441-30053500 


Phis 


30056541-30056622 


Plus 


30058870-30059029 


Plus 


30059294-30059367 


Plus 


30223877-30224048 


Plus 


3056198840562044 


Plus 


33172949-33173281 


Plus 


3333877643338919 


Plus 


33673947-33677345 


Plus 


3401393944014124 


Phis 


34187937-34188061 


Plus 


34231611-34231720 


Plus 


34239511-34239619 


Plus 


3426734044267424 


Plus 


3426895344269083 


Plus 


3429606844296249 


Plus 


3435388144354826 


Plus 


2569428-2570275 


Plus 


4617967-4618783 


Plus 


16170215-16170415 


Plus 


167287-167616 


Plus 


983890-985529 


Plus 


985591-986221 


Ptus 


986296-986670 


Ptus 


987908-988364 


Plus 


989276-890813 


Plus 


991906-993240 


Plus 


1588782-1588858 


Plus 


2007241-2007363 


Plus 


246460^-2464739 


Plus 


2817668-2817723 


Plus 


44996294499739 


Plus 


4509509-4509601 


Phis 


4541167-4541466 


Plus 


4560526-4560699 


Plus 


6251639-6251761 


Plus 


6436416^436719 


Plus 


6773762-6774057 


Plus 


6856506-6856634 


Plus 


70772627077326 


Plus 


9200300^200399 


Plus 


10138365-10138458 


Plus 


11228329-11228403 


Ptus 


11525273-11525527 


Plus 


11854506-11854792 


Plus 


13203550-13203973 


Plus 


13660250-13660530 


Plus 


14785058-14785225 


Plus 


16949229-16949306 


Plus 


22)50289-22050370 


Ptus 


22255234-22255437 


Plus 


23503596-23504240 


Plus 


23533763-23534008 


Plus 


23608358-23608526 
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OOI £10 


Dunham, I. still. 


337219 


uunnam, i. eufli. 


337236 


Dunham, 1, etal. 


007040 


Dunham, 1. etaL 


337259 


uunnam, i. etai. 


00*7001 


uunnam, i, etai. 


337298 


uunnam, I. etaL 


33MU4 


Dunham. 1. etal. 


337333 


Dunham, 1. etal. 


337338 


uunnam, 1. etal. 


4470/C 

337345 


|V_L B _. | _4 _l 

Dunnam, I. etal. 


337355 


Dunham, I, etal 


337363 


Dunham, 1. etal 


337396 


Dunnam, l. etal. 


007/AO 

•537402 


Dunham, i. eta). 


007/AO 

337403 


Dunham, 1. etaL 


447/ OS 

3374/0 


Dunham, 1. etaL 


447/ OC 

337426 


Dunnam, 1. etal. 


337427 


Dunham, I. etal. 


337435 


p*_. l_ | x __i 

Dunham, 1. etaL 


337484 


Dunham, 1. etaL 


337469 


Dunham, I.etaL 


447EAO 

337503 


Dunham, l. etal. 


337504 


Dunham, 1. etal. 


337522 


Dunham, 1. etaL 


447C// 

337544 


Dunham, 1. etaL 


337548 


Dunham, 1. etal. 


337562 


Dunham, 1. etal 


337564 


Dunham, 1. etal. 


AA^f-AA 

337586 


Dunham, J. etaL 


337628 


Dunham, 1. etaL 


337657 


Dunham, I.etaL 


337685 


Dunham, 1. eta!. 


337695 


Dunham, 1. etal. 


337697 


Dunham, 1. etal. 


337756 


Dunham, 1. etaL 


337781 


Dunham, 1. etaL 


337797 


Dunham, 1. etal. 


337835 


Dunham, 1. etal. 


337643 


Dunham, I.etaL 


337865 


Dunnam, 1. etaL 


337870 


Dunham, 1. etaL 


337304 


Dunham, 1. etal. 


337918 


Dunham, L etaL 


337930 


Dunham, i. etaL 


337947 


Dunham, 1. etaL 


337951 


Dunham, 1. etaL 


337954 


Dunham, L etal. 


337964 


Dunham, L etaL 


338043 


Dunham, 1. etaL 


338054 


Dunham, 1. etaL 


338082 


Dunham, 1. etal. 


338080 


Dunham, 1. etal 


338090 


Dunham, 1. etal. 


AAA And 

338096 


Dunham, 1. etaL 


338120 


Dunham, L etaL 


AAA J AA 

338123 


Dunham, L etal. 


338124 


Dunham, 1. etal. 


338177 


Dunham, 1. etaL 


338239 


Dunham, L etaL 


338276 


Dunham, 1. etaL 


AAA AAA 

338280 


Dunham, 1. etaL 


338326 


Dunham, L etal. 


AAAAA_> 

338336 


Dunham, 1. etal. 


44.A!Ud 


Dunham 1 a4 al 


338411 


Dunham, I.etaL 


338412 


Dunham, 1. etaL 


338491 


Dunham, I.etaL 


338514 


Dunham, 1. etal. 


338516 


Dunham,!, etal. 


338521 


Dunham, t. etal 


338522 


Dunham, 1. etal. 


338526 


Dunham, 1. etal. 


338527 


Dunham, L etal. 


338529 


Dunham, 1. etal. 


338579 


Dunham, 1. etal. 



nUS 


Oft 1 9ftAfifi_9fi i Oficcc 
_0l_040O-_0 1 ZOOuO 


Plus 


oc/oocoa ojmoooco 


□hie 

rlUS 


074 4J14.4Q 0744/074 


□lite 

rlUS 


074/4/0C 074/4770. 


Dtirc 

rlUS 


/707 WOrli 07414*: 


Plus 


OQOOOOflO 90000/4 0 


Plus 


ZSu02w9-«QU480o 


Plus 


00444004 00444044 

29131691-Z91318Z3 


Plus 


on— o 4000 40004674 

302fi13o3-302olD73 


RUS 


40/O7CA0 iryATTTtC 

30427592-30427775 


Plus 


one /no 07 000/0/07 

30o493o7-3u649467 


Plus 


4O04O040 0004404 0 

3Uo32B3fl-3QB33D42 


Plus 


30942712-30942840 


Plus 


31585902-31586067 


Plus 


440A444/ 44MiiAA 

31691 134-31 6914B2 


Plus 


44PA4440 AiMn^4< 

31692319-31692611 


Plus 


32072241-32072334 


Plus 


32076190-32076349 


Plus 


32078958-32079076 


Plus 


32255715-32255844 


Plus 


A4nAOA<M MA4A47A 

33238033-33238176 


Plus 


AAAAf J AAAAf ATlt 

33295724-33295872 


Plus 


33385583-33385857 


Plus 


444000C4 4440OO4O 

333oDfl03^33ffy?ff 


Plus 


AAAM4BA MAAAAm 

33963168-33963979 


Plus 


34268412-34268495 


Pius 


4//7O0OO 4/JITOOOT 

34472882-34472957 


Plus 


AWMAC ATMAi 

278105-278281 


Plus 


285643-285788 


Plus 


952102-952250 


Plus 


2013838-2014032 


Plus 


A0140C4 ooier/vce 

2844851-2845055 


Plus 


4EJT4£4 OC17AJC 

3547151-3547245 


Plus 


3609o70-3609808 


Plus 


APA1/TTJ A A if JAP 

3544971-3645195 


Plus 


3975015-3975172 


Plus 


/4400/0 /44040C 

4116049-4116265 


Plus 


4325712-4325811 


Plus 


47922994792423 


Plus 


JAAJA0O J Art PA A A 

4984966-4985082 


Plus 


5377705-5377861 


Plus 


5442516-5442636 


Plus 


p^Apo 4 f\ r A nAAiA 

5685819-5686012 


Plus 


6031241-6031358 


Plus 


AAAAPAA A AAAAA-A 

6393566-6393692 


Plus 


aati-a i A AA-tnni 

6675643-6875781 


Plus 


6766321-6766382 


Plus 


/-> nA/ J r> A AAA J AAA 

6831483-6831620 


Plus 


1AAA7AA «f AAA AAA 

70327207032802 


Plus 


8222138-8222430 


Plus 


8429258-8429382 


Plus 


8578852-8579012 


Plus 


AAiAii A AAiAAflA 

9212114-9212220 


Plus 


9488619-9488858 


Plus 


9698397-9698469 


Plus 


10765673-10765820 


Plus 


10815824-10815886 


Plus 


A AAAAAi A A AAAA A^A 

10860311-10860471 


Plus 


A A*^ A 4 fAft iA^niYAi 

12781530-12781724 


Plus 


14669918-14670016 


Plus 


_fA<IAA*»JlJ 4l A _f AAA J A 

16168714-16168840 


Plus 


16171884-16172039 


Plus 


A «VA AAAAA i^APAAAi 

17358806-17358964 


Plus 


17453324-17453448 


Plus 


17780512-17780623 


Plus 


19299273-19299522 


Pius 


19340661-19340737 


Plus 


21168430-21168556 


Plus 


21379420-21379655 


Pius 


21382180-21382383 


Pius 


21471358-21471476 


Phis 


21471617-21471711 


Plus 


21551136-21551250 


Plus 


21552237-21553280 


Plus 


21635616-21635867 


Pius 


22774968-22775132 
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338596 


Dunham, I, eta 


338648 » 


Dunham, 1. eial. 


338679 


Dunham, l.etal. 


338703 


Dunham, l.etal 


338705 


Dunham, l. etal. 


338747 


uunnam, i. eiau 


338770 


Dunham, 1. etal 


338809 


Dunham, l.etal 


338830 


Dunham, tela). 


338880 


Dunham, l.etal. 


338895 


Dunham, l.etal. 


338896 


Dunham,l. etal 


338911 


Dunham, l.etal 


338925 


Dunham, l.etal. 


338952 


Dunham, l.etal. 


338959 


Dunham, L etal. 


338970 


uunnam, I. etai 


339081 


Dunham, l.etal. 


339116 


Dunham, 1. etal. 


339164 


Dunham, 1. etal. 


339246 


Dunham, 1. etal 


339271 


Dunham, 1. etal. 


339304 


Dunham, l.etal. 


339309 


Dunham, l.etal 


339312 


Dunham, l.etal 


339356 


Dunham, l.etal. 


339357 


Dunham, l.etal. 


3393$) 


Dunham, l.etal 


339384 


Dunham, l.etal 


339413 


Dunham, l.etal. 


339419 


Dunham, l.etal. 


332854 


Dunham, l.etal 


332859 


Dunham, l.etal 


332896 


Dunham, l.etal. 


332995 


Dunham, 1. etal. 


333014 


Dunham, l.etal 


333043 


Dunham,), etal 


333054 


Dunham, 1. etal. 


333069 


Dunham, l.etal. 


333074 


Dunham, l.etal. 


333101 


Dunham, l.etal. 


333140 


Dunham, (.etal 


333142 


Dunham, 1. etal. 


333175 


Dunham, l.etal. 


333216 


Dunham, l.etal. 


333219 


Dunham, l.etal. 


333220 


Dunham, 1. etal 


333240 


Dunham, 1. etal. 


333242 


Dunham, i. etal 


333251 


Dunham, 1. etal 


333252 


Dunham, l.etal. 


333257 


Dunham, I etal. 


333258 


Dunham, l.etal. 


333271 


Dunham, I etal 


333294 


Dunham, l.etal 


333296 


Dunham, I etal. 


333312 


Dunham, l.etal. 


333313 


Dunham, I etal 


333323 


Dunham, I etaL 


333329 


Dunham, 1. etal. 


333339 


Dunham, I. etal. 


333346 


Dunham, 1. etaL 


333355 


Dunham, 1. etal 


333408 


Dunham, 1. etal. 




uunnam, i. etai. 


333424 


Dunham, l.etal 


333441 


Dunham, I. etal 


333466 


Dunham, l.etal. 


333495 


Dunham, 1. etal. 


333496 


Dunham, 1. etaL 


333541 


Dunham, l.etal. 


333545 


Dunham, 1. etal. 


333561 


Dunham, l.etal. 


333568 


Dunham, l.etal. 


333581 


Dunham, l.etal. 


333582 


Dunham,!, etal 



Plus 


2307B273-2307o34o 


Plus 


24091 259-24091434 


Plus 


246o4o3&>246o4940 


Plus 


25219ooi-2o22QU4o 


Plus 


25231397-25231551 


Plus 


26336372-26336460 


Plus 


26784513-2o7o4714 


Plus 


27271920-27272000 


Phis 


27759923-27760055 


Plus 


28420713-28420626 


Plus 


28598893-28599135 


Plus 


28618487-28618552 


Plus 


28804184-28804445 


Pius 


28883892-28884036 


Plus 


29418831-29418968 


Plus 


29478431-29478589 


Plus 


28645460-2964561 3 


Plus 


30927871-30928080 


Plus 


31462304-31462382 


Plus 


32207441-32207802 


Plus 


440JJC40 O'HJJJJTfltf 

3284451o-3x844586 


Phis 


32999935-33000078 


Plus 


lOAnA^i O AOAAAAA4 

33323712-33323821 


Plus 


1AAAAAAA AAA/mJAA 

3336690033367198 


Phis 


33378992-33379172 


Phis 


33573387-33573517 


Pius 


33575210-33575416 


Plus 


33579433-33579593 


Plus 


33950591-33950697 


Plus 


34268734-34268675 


P)us 


34359680-34359777 


Mnus 


1283611-1283053 


Minus 


1358943-1358861 


Minus 


1631641-1631422 


Minus 


AVAAAJf A^AftAhr 

2708847-2708685 


l\tes 


2772479-2772387 


Minus 


2920906-2920781 


Minus 


2995314-2995219 


Minus 


3040771-3040661 


Minus 


3054171-3054008 


Minus 


3208966-3208908 


Minus 


AA99AAA OA9/WIAA 

33772203376309 


Minus 


3474264-3474145 


Minus 


3768113-3768051 


Minus 


3967627-3967512 


A/Enus 


3968531-3968423 


Minus 


AABAA0A AA/^A^AA 

39593633968789 


Minus 


JiAAiAf iiAAAAi 

4100495-4100281 


Minus 


4104544-4104259 


Minus 


4168970-4168870 


Minus 


4172550-4172390 


Minus 


43007244300578 


Minus 


4301348-4301269 


Minus 


4377786-4377589 


Minus 


4538726-4538642 


Minus 


4550766-4550644 


Minus 


4638794-4638635 


Minus 


4639397-4639277 


Minus 


4655333-4655166 


Minus 


46617794661672 


Minus 


4679691-4679610 


Minus 


4704564-4704466 


Minus 


47275554727460 


Minus 
Minus 


49368794936661 
4958233-4957593 


Minus 


4959694-4958713 


Minus 


2708847-2708685 


Minus 


2708847-2708685 


Minus 


5404039-5403881 


Minus 


5404643-5404523 


Minus 


5859241^5859098 


Minus 


58855395885064 


Minus 


59036595903590 


Minus 


5965072-5964999 


Minus 
Minus 


6151638-6151159 
6158522-6158322 
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333665 


Dunham, I. etal Minus 


333713 


Dunham, I. etal. Minus 


333730 


Dunham, I. etal. Minus 


333737 


Dunham, I. etal. Minus 


333762 


Dunham, I. etal. Minus 


333615 


Dunham, I. etal Minus 


333895 


Dunham, I. etal. Minus 


333963 


Dunham, I. etal. Minus 


333910 


Dunham,!. etal Minus 


333944 


Dunham, I. etal. Minus 


333965 


Dunham, I etal. Minus 


334035 


Dunham, I. etal Minus 


334125 


Dunham, 1. etal. Minus 


334137 


Dunham, 1. etal Minus 


334172 


Dunham, 1. etal. Minus 


334175 


Dunham, 1. etal. Minus 


334191 


Dunham, 1. eta). A/finus 


334196 


Dunham, 1. eta). Minus 


334198 


Dunham, 1. etal. Minus 


334215 


Dunham, 1. etal. Minus 


334237 


Dunham, 1. etal Minus 


334254 


Dunham, 1. etal. Minus 


334304 


Dunham, I. etal. Affirms 


33435B 


Dunham, 1. etal. Minus 


334396 


Dunham, 1. etal. Minus 


334399 


Dunham, 1. etal. Minus 


334418 


Dunham, 1. etal Minus 


334470 


Dunham, 1. etal. Minus 


334474 


Dunham, I. etal. Minus 


334476 


Dunham, 1. etal Minus 


334496 


Dunham, 1. etal. Minus 


334540 


Dunham, 1. etal Minus 


334548 


Dunham, f. etal Minus 


334560 


Dunham, 1. eta). Minus 


334591 


Dunham, 1. etal Minus 


334593 


Dunham, 1. etal Minus 


334625 


Dunham, 1. etal Minus 


334650 


Dunham,!. etal Minus 


334662 


Dunham, 1. etal Minus 


334690 


Dunham, 1. etal. Minus 


334691 


Dunham, I. etal Minus 


334745 


Dunham,/. etal Minus 


334752 


Dunham, 1. eta). Mnus 


334759 


Dunham, 1. etal Minus 


334760 


Dunham, 1. etal Minus 


334764 


Dunham, 1. etal Minus 


334766 


Dunham, 1. etal Minus 


334779 


Dunham,!. etal Minus 


334782 


Dunham, 1. etal. Minus 


334783 


Dunham, 1. etal Minus 


334784 


Dunham, 1. etal Minus 


334785 


Dunham, 1. etal. Minus 


334788 


Dunham,!. etal Minus 


334793 


Dunham, 1. etal Minus 


334804 


Dunham, 1. etal Minus 


334823 


Dunham, I. etal. Minus 


334827 


Dunham, 1. etal Minus 


334836 


Dunham, 1. etal Minus 


334842 


Dunham, 1. etal. Minus 


334946 


Dunham, 1. etal Minus 


334948 


Dunham, 1. etal. Minus 


334955 


Dunham, 1. etal Minus 


334958 


Dunham, 1. etal Minus 


334966 


Dunham,!. etal. Minus 


334969 


Dunham, 1. etal. Minus 


334970 


Dunham, I. etal Minus 


335000 . 


Dunham, 1. etal. Minus 


335002 


Dunham, 1. etal Minus 


335009 


Dunham, 1. etaL Minus 


335068 


Dunham, 1. etal. Minus 


335090 


Dunham, I. eta). Minus 


335092 


Dunham, 1. etal Minus 


335108 


Dunham, 1. etal Minus 


335115 


Dunham, ). etal Minus 


335116 


Dunham, 1. etal Minus 


335217 


Dunham, 1. etal. Minus 



6975471-6975215 

7289490-7289337 

7489326-7489007 

7551981-7551842 

7679466-7679343 

7871914-7871764 

819384&8193735 

8216956^216810 

8230221-8230116 

85570514556936 

86274844626258 

93235084323292 

10433521-10432612 

10461063-10460708 

11644142-11644008 

11668659-11668597 

11932215-11932091 

11966322-11966182 

12032364-12032146 

12670395-12670077 

12994394-12993954 

13199268-13198999 

13455315-13455219 

13724372-13724201 

14164482-14164340 

14166269-14186163 

14237235-14237028 

14389581-14389442 

14391920-14391809 

14394777-14394589 

14480743-14480669 

14832365-14832224 

14837304-14837150 

14980648-14980485 

15035270-15035057 

15036374-15036242 

15244936-15243666 

15371251-15371178 

15520806-15520668 

15603055-15602918 

15615920-15611418 

16049960-16049653 

16118059-16117848 

16134427-16134207 

16136658-16136578 

16151208-16151104 

16154663-16154529 

16276881-16276815 

16291693-16291574 

16293336-16293226 

16294548-16294360 

16295583-16295408 

16301237-16301081 

16330748-16330681 

16428342-16428208 

16851360-16851189 

16905877-16905684 

17215581-17215418 

17464352-17464181 

20140514-20140410 

20141727-20141583 

20151753-20151684 

20156046-20155932 

20169171-20169079 

20188176-20188020 

20195886-20195554 

20455403-20455348 

20479143-20479008 

2Q602011-20601951 

20955606-20955458 

21192979-21190943 

21298638-21298571 

21324221-21324030 

21388250-21388146 

21388573-21388414 

21801622-21801195 



211 



WO 02/068677 



PCI7US02/06001 



335234 


Dunham, 1. elaL 


335236 


Dunham, 1. elaL 


335247 


Dunham, 1. etal 


335262 


Dunham, 1. etal. 


335265 


Dunham, 1. elaL 


335266 


Dunham, 1. etaL 


335281 


Dunham, I. etal. 


335331 


Dunham, 1. etal. 


335339 


Dunham, t. etaL 


335340 


Dunham, 1. etaL 


335344 


Dunham, 1. etaL 


335346 


Dunham, 1. etal. 


335349 


Dunham, 1. etal 


335523 


Dunham, 1. etal. 


335527 


Dunham, Letal 


335566 


Dunham, Letal. 


335606 


Dunham, I. etaL 


335750 


Dunham, 1. eta). 


335755 


Dunham, 1. etal. 


335766 


Dunham, I. etaL 


335774 


Dunham, 1. etal. 


335777 


Dunham, L etal. 


335782 


Dunham, 1. etal. 


335787 


Dunham, 1. etaL 


335606 


Dunham, 1. etaL 


335817 


Dunham, 1. etal. 


335827 


Dunham, 1. etal. 


335831 


Dunham, Letal 


335832 


Dunham, 1. etaL 


335863 


Dunham, I. etal 


335895 


Dunham, 1. etal. 


335902 


Dunham, Letal 


335920 


Dunham, 1. etaL 


335956 


Dunham, 1. etal. 


335980 


Dunham, 1. etal. 


335999 


Dunham, Letal 


336000 


Dunham, Letal. 


336029 


Dunham, 1. etal 


336142 


Dunham, 1. etal. 


336149 


Dunham, 1. eta). 


336161 


Dunham, 1. etaL 


336177 


Dunham, 1. etaL 


336198 


Dunham, 1. etal. 


338199 


Dunham, L etal 


336200 


Dunham, 1. etaL 


336202 


Dunham, I etal. 


336203 


Dunham, Letal. 


336227 


Dunham, Letal 


336231 


Dunham, 1. etal 


336232 


Dunham, 1. etal. 


336238 


Dunham, L etaL 


336243 


Dunham, 1. etaL 


336246 


Dunham, Letal 


336279 


Dunham, 1. eta). 


336280 


Dunham, 1. eta). 


336294 


Dunham, 1. etal. 


336295 


Dunham, L etaL 


336328 


Dunham, Letal. 


336347 


Dunham,!. etaL 


336351 


• Dunham, I. etal. 


336397 


Dunham, 1. etal 


336400 


Dunham, 1. etal. 


336402 


Dunham, L eta). 


336406 


Dunham, 1. etal 


336417 


Dunham,!, etal 


336455 


Dunham, L etal. 


336463 


Dunham,!, etal 


336479 


Dunham, 1. etal. 


336510 


Dunham, 1. etal. 


336538 


Dunham, 1. eta). 


336584 


Dunham,!, etal. 


336599 


Dunham, L etal. 


336615 


Dunham, L etaL 


336616 


Dunham, 1. etaL 


336618 


Dunham, Letal 


336645 


Dunham, 1. etal. 



Minus 


AiAAiAA^ ru AA M AAA 

21904287-21904080 


Minus 


21915016-21914870 


Minus 


21944287-21944209 


Minus 


22065821-22065630 


Minus 


22102127-22102000 


Minus 


22108492-22108411 


Minus 


22237614-22237503 


Minus 


22556823-22556708 


Minus 


22569208-22569045 


Minus 


22570500-22570342 


Minus 


22600566-22600498 


Minus 


22646377-22646122 


Minus 


22661861-22661271 


Minus 


24235282-24235175 


Minus 


24241519-24241370 


Minus 


24892522-24891998 


Minus 


25056719-25055010 


Minus 


25732501-25731972 


Minus 


25763806-25763747 


Minus 


25819821-25819690 


Minus 


25883733-25883572 


Minus 


25885770-25885599 


Minus 


25908578-25908440 


Minus 


25947022-25946B35 


Minus 


26217475-26217146 


Minus 


26321875-26321750 


Minus 


26380557-26380472 


Minus 


26321526-26321349 


Minus 


26383701-26383583 


Minus 


26693017-26692763 


Minus 


26975307-26975239 


Minus 


26985491-26985358 


Minus 


27034927-27034611 


Minus 


27653729-27653635 


Minus 


27846751-27846614 


Minus 


28033986-28033848 


Minus 


28105547-28105354 


fvfinus 


28732042-28731966 


Minus 


30135157-30135054 


Minus 


3014818540148059 


Minus 


30169370-30169185 


Mnus 


3030663840303284 


Minus 


3045966840459460 


Minus 


3046124540461040 


Minus 


3046297140462895 


Minus 


3046893340468781 


Minus 


3047079640470653 


Minus 


3090201440901946 


fvfinus 


30967656-30967471 


Minus 


3097770740977645 


Minus 


3116008341159959 


Minus 


3140223741402104 


Minus 


3142566941425253 


Minus 


32103203-32102860 


Minus 


3210358842103497 


Minus 


3294699742946890 


Minus 


3295029142950162 


Minus 


3358446143584201 


Minus 


3384321843843104 


Minus 


3386589043865681 


Minus 


3402150444021389 


Minus 


3402343744023298 


Minus 


3402409044023981 


Minus 


3403077444030564 


Minus 


3404777144047675 


Minus 


3420915544209018 


Minus 


3421223644211968 


Minus 


04Z1BZZ4-3421S139 


Minus 


3427704644276928 


Minus 


3432927044329189 


Minus 


3451656644516468 


Minus 


11736500-11734418 


Minus 


26020522-26016546 


Nfinus 


26021027-26020848 


Minus 


25799-25612 


Minus 


1351268-1351168 
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33S653 


Dunham, 1. etal. 


336657 


• Dunham, L etal. 


336717 


Dunham, LetaL 


336772 


Dunham, Letal. 


336795 


Dunham, 1. etal. 


336796 


Dunham, 1. etal 


336854 


Dunham, 1. etal. 


336858 


Dunham, 1. etal. 


336862 


Dunham, 1. etal. 


336863 


Dunham, 1. etal 


336883 


Dunham, 1. etat 


336908 


Dunham, L etal. 


336927 


Dunham, 1. etal. 


336929 


Dunham, i. etal 


336978 


Dunham, 1. etal. 


336981 


Dunham, 1. etal. 


336984 


Dunham, 1, etal 


336994 


Dunham, 1. etal 


336999 


Dunham, 1. etal. 


337011 


Dunham, 1. etat 


337023 


Dunham, 1. etal 


337069 


Dunham, 1. etal 


337092 


Dunham, t etal 


337093 


Dunham, 1. etal 


337094 


Dunham, LetaL 


337097 


Dunham, letal' 


337100 


Dunham, Letal 


337114 


Dunham, 1. etal 


337203 


Dunham, LetaL 


337204 


Dunham, 1. etal. 


337225 


Dunham, L etal 


337244 


Dunham, LetaL 


337279 


Dunham, 1. etat. 


337289 


Dunham, 1. etal. 


337316 


Dunham, 1. etal. 


337431 


Dunham, 1. etal 


337432 


Dunham, 1. eta). 


337445 


Dunham, LetaL 


337451 


Dunham, L etal 


337452 


Dunham, 1. etal 


337455 


Dunham, 1. etal. 


337456 


Dunham, 1. etal. 


337513 


Dunham, 1. etal 


337514 


Dunham, L etal 


337517 


Dunham, L etal 


337590 


Dunham, 1. etal. 


337592 


Dunham, L etal 


337604 


Dunham, LetaL 


337606 


Dunham, 1. etal 


337612 


Dunham, Letal. 


337645 


Dunham, L etal 


337704 


Dunham, 1. etal 


337706 


Dunham, Lelal 


337760 


Dunham, L etal. 


337768 


Dunham, L etal 


337771 


Dunham, 1. etal 


337778 


Dunham, Letal 


337787 


Dunham, 1. etal 


337816 


Dunham, 1. etal 


337821 


Dunham, Letal. 


337848 


Dunham, 1. etal. 


337911 


Dunham, L etal 


337913 


Dunham, 1. etal 


337974 


Dunham, 1. etal. 


337984 


Dunham, 1. etal. 


338069 


Dunham, 1. etal 


338087 


Dunham, Letal. 


338113 


Dunham, 1. etal. 


338132 


Dunham, 1. etal 


338140 


Dunham, 1. etal 


338151 


Dunham, 1. etal. 


338221 


Dunham, Letal 


338223 


Dunham, Letal 


338235 


Dunham, Letal. 


338271 


Dunham, Letal. 


338285 


Dunham, Letal 



Minus 


1568327-1568216 


Minus 


1705497-1705415 


Mfous 


32974174297352 


Minus 


512993M129785 


Minus 


568158W681460 


Minus 


5683253-5683016 


Minus 


7884839-7884767 


Mftus 


8200946-6200789 


Minus 


83945938394369 


Minus 


8396673-8396425 


Minus 


9322767-9322554 


Minus 


10791347-10791256 


Minus 


1154811M1547937 


Minus 


11603040-11602908 


Minus 


14241223-14241030 


Minus 


14478638-14478472 


Minus 


14725450-14725309 


Minus 


15104752-15104656 


Minus 


15389579-15388897 


Minus 


16106423-16106080 


Minus 


16346174-16346065 


Minus 


19014379-19014222 


Minus 


20140132-20139980 


Minus 


20145693-20145570 


Minus 


20146915-20146778 


Minus 


20498648-20497686 


Minus 


20543830-20543751 


Minus 


21328259-21328050 


tonus 


25349286-25349156 


Minus 


25419024-25418931 


Minus 


26607638-26607468 


Minus 


27362048-27361952 


tonus 


28430185-28430033 


Minus 


28756246-28758163 


Minus 


29657129-29656997 


Minus 


3211052942110475 


fta 


3219913442198922 


Minus 


3229238042292228 


Minus 


3240868042408576 


Minus 


324151B742415117 


Minus 


3243451742434425 


Minus 


3243888742438828 


Minus 


3347869243478562 


Minus 


3348440843484339 


Minus 


3379556843795385 


tonus 


982397-982246 


Minus 


1007791-1007634 


Minus 


1329838-1329759 


tonus 


1346786-1346517 


Minus 


1570235-1570142 


Minus 


51414624141329 


Minus 


36586704658545 


Minus 


36720564671922 


Minus 


4008389-4008037 


Minus 


40501344049973 


Minus 


4060797-4060707 


Minus 


4087301-4087157 


Minus 


41350974135025 


Minus 


45232034523090 


Minus 


45381894538075 


Minus 


50539644053829 


Minus 


58665044866154 


Minus 


61498434149786 


Minus 


7153401-7153085 


Minus 


7282160-7282020 


Minus 


8728655-8728580 


Minus 


9362067-9381878 


Minus 


1 0481242-1 0481089 


Minus 


10989617-10989530 


Nfinus 


11237977-11237876 


Nfinus 


11519629-11519501 


Minus 


14183649-14183568 


Minus 


14287001-14286830 


Minus 


14484254-14484179 


Minus 


16048151-16047985 


Minus 


16275030-16274895 
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OJOOUD 


Him ham 1 of ill MiniiQ 
uuniiain, 1. UUai. minus 


103 00 J IJ7™ IQ9Wu.Hf 


338325 


nnnham 1 ptal Mimic 
UUJUlaJII, 1. oLai. ivuilua 


17241596-17241507 


3383® 


Hi m Ham 1 otfll Minna 
UUrUIdJII, 1. OUIU IVUJIU9 


1 0£l tJJQi lot/ CvCJ 


00000 f 


UUnntuii, (• vLal. IY1UIU3 


1 8534404.18534234 


338441 
00040 1 


uunnarn, 1. buu. iyiiuuo 


19747B0A.1974740B 


3384.cn 

00 043U 


nimham 1 At ill M!nn<t 

uunnain, i. clbi. Minus 


2fl1 723fi4-201722nn 


44Q4C1 


uunnain, 1. euai. Minus 




14A4fifi 

000400 


uunnain, u euai. minus 


910R31 *i9-9in(ttn4Q 
£ 1 UDOl Ut~t IU00U49 


OOOOUO 


Dunham, I. etal. Minus 


210U01 01 '£\ ou/ yzo 


Jooo/o 
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Plus 


143648-144108 


328276 


6004471 


Plus 


13282-13450 


328277 


6004471 


Minus 


279901-280181 


328662 


6004473 


Plus 


1184773-1184855 


328636 


6004473 


Plus 


192484-192543 


328803 


6004475 


Minus 


291716-291948 


328305 


6004478 


Minus 


3473044851 


328569 


6004480 


Plus 


232896-233243 


328581 


6006033 


Minus 


121249-121400 


328582 


6006033 


Minus 


134177-134282 


326768 


6017031 


Minus 


223741-224238 


328770 


6017031 


Minus 


363933464166 


328841 


6381920 


Minus 


52144479 


328851 


6381923 


Plus 


2502-2606 


328859 


6381928 


Plus 


6904549138 


328860 


6381928 


Plus 


8326543366 


328863 


6381929 


Minus 


29313-29506 


328868 


6381930 


Plus 


112825-112993 


328876 


6525286 


Plus 


94053-941B5 


328886 


6588003 


Pius 


3106841429 


328888 


6588003 


Minus 


111901-111999 


328936 


5868500 


Minus 


1352202-1352259 


328938 


5868500 


Plus 


1522923-1522986 


328971 


6478806 


Minus 


23976-24105 


330338 


5457162 


Pius 


4840648516 


330327 


5919194 


Plus 


121551-121583 


330319 


5932415 


Plus 


4909540132 


328974 


5666520 


Plus ' 


3155741668 


328981 


5868527 


Minus 


105677-105764 


328989 


5868535 


Plus 


182088-182198 


330363 


3126882 


Mnus 


6183841901 


330370 


6580495 


Pius 


10826-11669 


329041 


5868564 


Plus 


141592-141785 


329076 


5866597 


Plus 


326796426860 


329097 


5868624 


Plus 


12002-12170 


329107 


5868626 


Plus 


101063-101190 


329114 


5868650 


Minus 


23792-23910 
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329116 


5868650 


329164 


5868691 


329187 


5868713 


329201 


5868716 


329221 


5868727 




coco-ron 
00 DO/ 0£ 


329254 


5868733 


329326 


5868806 


329330 


5868806 


329382 


5868868 


329384 


5868869 


329386 


6004484 


329140 


6017060 


329162 


6056331 


329018 


6249620 


329319 


. 6381976 


329392 


6476815 


329029 
329401 


6525302 
6682544 


329406 


6682547 


329411 


6682549 


329429 


5668882 


329436 


5868883 


329464 


6456788 



Minus 4338943493 

Plus 62305-62517 

Plus 2990940175 

Pius 79266-79539 

mm 105837-105894 

Minus 250541-250792 

Pius 41334214 

Pius 155884-155992 

Minus 340278440403 

Ptus 4140141655 

Minus 116524-116662 

Pius 160502-161110 

Plus 290842-290905 

Minus 662206-663423 

Plus 103950-104034 

Plus 721390-721470 

Plus 109766-109854 

Ptus 281445-282490 

Ptus 2134244014 

PIUS 4724947395 

Minus 8455844835 

Minus 97008-97091 

Plus 230265-230526 

fvfinus 44374538 



219 



WO 02/068677 



PCT/US02/06001 



TABLE 21: 

310 GENES UP-REGULATED IN COLON CANCER DERIVED LIVER 
METASTASES COMPARED TO NORMAL COLON TISSUE 

Table 21 shows 310 genes up-regulated in colon cancer derived liver metastases compared to 
normal colon tissue. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 
GeneChip array such that the ratio of "average" colon cancer derived liver metastases to 
"average" normal colon tissues was greater than or equal to 3.0. The "average" colon cancer 
derived liver metastases level was set to the 50th percentile. The "average" normal colon 
tissue level was set to the 50th percentile. 



Pkey: Unique Eos probeset identifier number 
ELxAccn* Excinplsr Accession omribcr, O en tannic Accession pneofoer 



UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

Rl: Genes up mets vs normal 



Pkey ExAccn Unigene© Unigene Title Rl 

446619 AU076643 Hs.313 secreted phosphoprotetn 1 (osteopontin, 26.72 

431958 X63629 Hs.2877 cadherin 3, type 1, P-cadherin (placenta 16.36 

409041 AB033025 Hs.50081 KIAA1 199 protein 1194 

444381 BE38733S H&283713 ESTs, Weakly similar to S64054 hypotheti 13.90 

432314 AA533447 Hs.312989 ESTs 1224 

428330 L22524 Hs.2256 matrix rnetaltoprotelnase 7 (matrifysin, 11.60 

443162 T49951 Hs.9029 DKFZP434G032 protstn 952 

436385 BE551618 Hs.144097 ESTs 9.20 

418662 AI801098 Hs.151500 ESTs 9.00 

433312 AI241331 Hs,131765 ESTs, Moderately similar to I38937 DNA/R 8.90 

412093 BE242691 Hs.14947 ESTs 8.74 

442369 AI565071 Hs.159983 ESTs 6.40 

426101 AL049987 Hs.166361 Homo saptens mRNA; cDNA DKFZp564F1 12 (fr 8.39 

435937 AA830893 Hs.119769 ESTs 8.22 

452281 T93500 Hs.28792 Homo sapiens cONA RJ1 1041 fis, clone PL 8.22 

432572 AI660840 Hs.191202 ESTs, WeaWy stmllar to ALUE_HUMAN III! 7.96 

440524 R71264 Hs.16798 ESTs 7.94 

424878 H57111 Hs^21132 ESTs 7.88 

430433 AA478883 Hs.273766 ESTs 7*2 

410245 C17908 Hs.194125 ESTs 7.78 

417315 AI080042 H&336901 ribosomal protein S24 7.76 

430665 BE350122 Hs.157367 ESTs, Weakly similar to I78885 serine/th 7.76 

432435 BE216886 H&282070 ESTs 7.74 

426818 AA554827 Hs.289115 DKFZp434A0131 protein 7.58 

419145 N99638 gbza39g11 .rl Soares fetal liver spleen 7.56 

444838 AV651680 HS208558 ESTs 7.54 

428046 AW812795 Hs.155381 ESTs, Moderately similar to 138022 hypot 7.48 

446682 AW205632 Hs^11198 ESTs 7.26 

421221 AW276914 Ha.326714 Homo sapiens clone IMAGE:713177, mRNA se 7.19 

440116 AI798851 Hs.283108 hemoglobin, gamma G 7.12 

450230 AW016607 Hs.201582 ESTs 7.08 
456332 AA228357 gfcnc39dQ5.r1 NCI_CGAPJ>r2Homo sapiens 7.04 

421814 L12350 Hs.108623 tnrombospondin 2 6.89 

440774 AI420611 Hs.127832 ESTs 6.86 

428065 A1634046 Hs.157313 ESTs 6.76 

422330 D30763 Hs.1 15263 epiregulin 6.72 

413950 AA249096 Hs.32793 ESTs 6.67 

438011 BE466173 Hs.145696 spRong factor (CC1. 3) 6.62 

421057 T58283 Hs.10450 Homo sapiens cONA: FU22063 fis, done H 6.58 

428698 AA852773 Hs.334638 WAA1866 protein 6.40 

408806 AW847614 Hs.289005 Homo sapiens cONA: FU21532 fis, clone C 6.38 

425787 AA363867 Hs.155029 ESTs 6.38 

435812 AA700439 Hs.188490 ESTs 6.32 

448974 AL049390 Hs.22689 Homo sapiens mRNA; cDNA DKFZp5B601318 (f 6.28 

418875 W19971 Hs233459 ESTs 6.22 

407284 AI539227 Hs214039 hypothetical protein FU23556 6.17 

408243 Y00787 Hs.624 fnterieukin 8 6.12 

434936 AJ285970 Hs.163817 ESTs 6.12 
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Hs.108932 ESTs 6.04 
450377 AB033091 Hs.74313 KIAA1265 protein 6.00 
407616 AW054922 Hs.53478 Homo sapiens cDNA FU12366 fis, clone MA 5.98 
408296 AL117452 Hs.44155 DKFZP586G1517 protein 554 
456999 AA319798 Hs5985B1 eijkaryofic translation elongation factor 5.90 
432559 AW452948 Hs.257631 ESTs 5.88 
423349 AF010258 Hs.127428 homeoboxA9 5.84 
436100 AA704806 Hs.143842 ESTs, Weakly similar© 2004399A chromes 5.84 
453204 R10799 Hs.191990 ESTs 5.84 
429183 AB014604 Hs.197955 KIAA0704 protein 5.78 
427882 AA640987 Hs.193767 ESTs 5.72 
447033 AI357412 Hs.157601 ESTs 5.70 
428054 A046668 Hs.266619 ESTs 5.66 
414504 AW069181 Hs.1 15175 sterile^lpha motif and leucine zipper c 5.64 
442806 AW294522 Hs.149991 ESTs 5.64 
418259 AA215404 Hs.137289 ESTs 5.60 
434983 AW974957 Hs.288719 Homo sapiens cDNA FU12142 fis, done MA 5.60 
419999 AI760942 Hs.191754 ESTs 5.58 
431749 AL049263 Hs306292 Homo sapiens mRNA; cDNA DKFZp564F133 (fr 5.58 
422790 AA809875 Hs^5933 ESTs 5.56 
440980 AL042005 Hs.1 117 trtpeptidyt peptidase II 5.48 
432451 AW972771 Hs.292471 ESTs, Weakly similar to ALU1.HUMAN ALU S 5.46 
438578 AA811244 Hs.164168 ESTs 5.44 
410467 AF102546 Hs.63931 dachshund (DrosophBa) homotog 542 
426317 AA312350 Hs.169294 transcription factor 7 (T-ceO specific 5.42 
450164 AJ239923 Hs.30098 ESTs 5.40 
438899 AF085833 Hs.135624 ESTs 538 
432945 AL043683 Hs.8173 hypothetical protein FU1 0803 5.36 
437176 AW176909 Hs.42346 caldnearirvbinding protein ca!sarcin-1 534 
419829 AI924228 Hs.115185 ESTs, Moderately simBar to PC4259 ferri 533 
407966 AA295052 Hs38516 Homo sapiens, clone MGC: 15887, mRNA, com 530 
447342 AI199268 Hs.1 9322 Homo sapiens, Similar to RIKENcONA 2010 5.26 
419682 H13139 Hs.92282 paired-like homeodomain transcription fa 5.26 
421097 AI280112 Hs.125232 Homo sapiens cONA FU13266 fis, clone OV 5.22 
443373 AI792868 Hs.1 35365 ESTs 5.22 
412059 AA317962 Hs^49721 ESTs, Moderately similar to PC4259 ferri 521 
443651 W22152 Hs.282929 ESTs 5.21 
411274 NM.002776HS.69423 kaffikretolO 5.17 
421999 U50535 Hs.1 10630 Human BRCA2 region, mRNA sequence CG006 5.17 
426981 AL044675 Hs.173081 KIAA0530 protein 5.14 
431319 AA873350 Hs302232 ESTs 5.10 
434966 AA657494 gb:nt66f04.s1 NCLCGAP_Pr3 Homo sapiens 5.10 

418830 BE513731 Hs.88959 hypothetical protein MGC481 6 5.08 
428290 A1932995 Hs.183475 Homo sapiens clone 25061 mRNA sequence 5.07 
408784 AW971350 Hs.63386 ESTs 5.04 
411975 AI916058 Hs.144583 ESTs 5.02 
409760 AA302840 gbf ST1 0534 Adipose tissue, white I Homo 4.97 

420717 AA284447 Hs.271887 ESTs 4.96 
417035 AA192455 Hs.22968 Homo sapiens clone IMAGE:451 939, mRNA se 4.95 
434442 AA737415 Hs.152826 ESTs 4.94 
441328 A1982794 Hs, 159473 ESTs 4.92 
438962 BE046594 gb:hn41c11j(1 NCLCGAPJU3F2 Homo sapiens 4.92 

451277 AK001123 Hs.26176 hypothetical protein RJ 10261 4.92 
438406 BE273296 Hs.254467 Homo sapiens cDNA FU13255 fe, done OV 430 
424950 AA602917 Hs.156974 ESTs 4.86 
436823 AW749865 Hs.293645 ESTs, Weakly similar to I38022 hypotfteti 4.87 
444763 AKD01468 Hs.62180 anfliin (DrosophOa Scraps homolog), act 4.82 
444301 AK000136 Hs.1 0760 asporin (LRR class 1) 4.80 
445390 AI222165 Hs.144923 ESTs 430 
439608 AW864696 Hs301732 hypothetical protein MGC5306 4.78 
450506 NM.004460Hs.418 fibroblast activation protein, alpha 4.78 
432682 AI376400 Hs.159588 ESTs 4.76 
426086 T94907 Hs.188572 ESTs 4.76 
435961 H74319 Hs.188620 ESTs 4.74 
432340 AA534222 gb:nj21d02^1 NCLCGAPJW Homo sapiens 4.72 

435756 AI418466 Hs.33665 ESTs 4.72 
447982 H22953 Hs.137551 ESTs 4.72 
449509 AA001615 Hs.84561 ESTs 4.72 
407946 AA226495 Hs.154292 ESTs 4.70 
426215 AW963419 Hs.155223 stanniocaicin 2 4.70 
414783 AW069569 Hs.278270 (inactive progesterone receptor, 23 kD 4.68 
417601 NM_014735Hs32292 KIAA0215 gene product . 4.68 
438461 AW075465 Hs.286049 phosphoserine aminotransferase 4.68 
449032 AA045573 Hsi2900 nuclear factor (erythrofc^terived 2>-iIk 4.68 
426501 AW043782 Hs.293616 ESTs 4.67 
409024 AW883529 Hs.173830 ESTs, Weakly similar to AUOUMAN ALUS 4.67 
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439846 AW979249 gbf ST391359 MAGE resequences, MAGP Homo 4.66 

424762 AL119442 Hs.183684 eukaryottc translation Initiation factor 4.66 
442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.62 
409632 W74001 Hs.55279 serine (or cysteine) proteinase inhfoto 4.62 
432409 AA806538 Hs.130732 KIAA1575 protein 4.60 
452220 BE158006 Hs.212296 ESTs 4.60 
442577 AA292998 Hs. 163900 ESTs 4.58 
434001 AW950905 Hs.3697 serine (or cysteine) proteinase Inhibito 4.58 
414271 AKD00275 Hs.75871 protein kinase C binding protein 1 4.58 
433854 AA610649 Hs.333239 ESTs 4.56 
431315 AW972227 Hs.163986 Homo sapiens cDNA: FU22765 fte, done K 453 
434220 Al 174777 Hs.283039 Homo sapiens PR02492 mRNA, complete cds 4i0 
457752 AIB21270 Hs.285543 Homo sapiens cDNA FU14364 fis, done HE 4.50 
449941 AW450536 Hs.209260 ESTs 4.48 
415116 AA160363 Hs.269956 ESTs 4.47 
414386 X00442 Hs.75990 haptoglobin 4.47 
422956 BE545072 Hs.122579 hypothetical protein FU10461 4.44 
423974 AL118754 gb:DKFZp761P1910_r1 761 (synonym: hamy2) 4.44 

44961B A1076459 H&15978 WAA1272 protein 4.44 
428279 AA425310 Hs.155766 ESTs, WeaWy similar to A47582 B-ceD gr 4.42 
430573 AA744550 Hs.136345 ESTs 4.42 
430929 AA489166 Hs.156933 ESTs 4.40 
433530 BE349534 H&281789 ESTs 4.40 
446099 T93096 Hs.17126 hypotheticai protein MGC15912 4.40 
447082 T85314 Hs.42644 thioredoxin-like 439 
407168 R45175 Hs.117183 ESTs 4.38 
417067 AJ001417 Hs.81086 solute carrier family 22 (extrarenal 4.38 
408380 AF123050 Hs.44532 dlubiquitin 4.36 
431379 AA504264 Hs. 182937 peptjdylprolyl isomerase A (cyctophilin 4.36 
406671 AA129547 Hs.285754 met proto-cncogene (hepatocyte growth fa 434 
419317 AA236282 Hs.172316 ESTs 432 
450295 AI766732 Hs.210628 ESTs 432 
423578 AW960454 Hs-222830 ESTs 431 
419553 N34145 Hs.250614 ESTs, Moderately simBar to ZN91.HUMAN Z 431 
429512 AA453987 Hs.144802 ESTs 4.30 
426848 H72531 Hs.36190 ESTs 4.30 
429831 AA564489 Hs.137526 ESTs 4.30 
433735 AA608955 Hs.109653 ESTs 4.30 
450546 AA010200 Hs.175551 ESTs 4.27 
421059 AI654133 Hs.30212 thyroid receptor interacting protein 15 4.27 
413243 AA769266 Hs.193657 ESTs 4.26 
433230 AW136134 Hs.220277 ESTs 4.22 
439717 W94472 Hs.59529 ESTs. Moderately similar to AW1_HUMAN A 4.20 
439362 AI954880 Hs.134604 ESTs 4.19 
450157 AW961576 Hs.60178 ESTs 4.17 
451690 AW451469 H&209990 ESTs 4.17 
41B661 NW.001949Hs.1189 E2F transcription factor 3 4.16 
443135 AI376331 Hs.156103 ESTs 4.16 
443148 AI034357 Hs.211194 ESTs, Weakly similar to ALU8_HUMAN ALU S 4.16 
407765 AW076027 Hs.257711 ESTs, Moderately similar to ALU8_HUMAN A 4.14 
428825 AI084336 Hs.128783 ESTs, WeaWy similar to 138022 hypotheti 4.14 
447519 U46258 Hs339665 ESTs 4.14 
439451 AF086270 Hs.278554 heterochromatin-iSte protein 1 4.12 
450219 A1826999 Hs.224624 ESTs 4.12 
431451 AA761378 Hs.192013 ESTs 4.11 
432917 NMJ>14125Hs779812 PRO0327 protein 4.10 
431328 AA5Q2999 H&291591 ESTs 4.09 
425992 AA367069 Hs.100636 ESTs 4.08 
404571 4.06 
420911 U77413 Hs.100293 CWinked N-acetylglucosamira (GlcNAc) tr 4.06 
421114 AW975051 Hs293156 ESTs, Weakly similar to 178885 serine/th 4.06 
432731 R31 178 Hs.287820 fibronectin 1 4.06 
433588 AI056872 Hs.133386 ESTs 4.06 
434658 AI624436 Hs310286 ESTs 4.06 
444040 AP204231 Hs.182982 golgin-67 4.06 
444984 H15474 Hs.132898 fatty add desaturase 1 4.06 
438543 AA810141 Hs.192182 ESTs 4.05 
413497 BE177661 gb«C1-HT0598-0203(«W)11+02HT0598Horno 4.04 

434575 AI133446 Hs.299954 Homo sapiens done FLB7723 PRO2055 mRNA, 4.04 
430256 AA470152 Hs.192195 ESTs 4.04 
424839 AA740632 Hs. 120850 ESTs, WeaWy similar to ALU1J4UMAN ALU S AM 
429048 AI372949 Hs.44241 Homo sapiens cDNA:FU21447fis, done C 4.02 
449429 AA054224 Hs.59847 ESTs 4.02 
410762 AF226053 Hs.66170 HSKM-B protein 4.00 
418876 AA740616 gteny97f11.s1 NCLCGAP.GCB1 Homo sapiens 4.00 

425905 AB032959 Hs318584 novel C3HC4 type Zinc ringer (ring finge 4.00 
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429500 X78565 Hs589114 hexabrachion {tenasdn C, cytotacfln) 4.00 

431393 AW971493 Hs.134269 ESTs, Highly similar to cytokine recepto 4.00 

435008 AF150262 Hs.162898 ESTs 4.00 

431361 AW971375 Hs592921 ESTs 3.97 

444816 Z48633 Hs.283742 H.sapiens mRNA for retrotransposon 3.96 

434701 AA460479 Hs321707 KIAA0742 protein 3.95 

413886 AW958264 H&103832 similar to yeast UpB, variant B 3.95 

424905 NM.002497HS.153704 NIMA (never in mitosis gene a)-retated k 3.92 

428479 Y00272 Hs.184572 ceil division cycle 2, G1 to S and G2 to 3.91 

435714 AA6S9325 Hs.269880 ESTs 3.86 

447514 AJ609314 Hs508501 ESTs, Weakly similar to 634087 hypothec 3.86 

453818 BE256832 Hs.10711 hypothetical protein RJ13449 3.85 

433566 T85301 gtoyd78d06.s1 Scares fetal Over spleen 3.85 

440638 A1376551 gb.1e64e10jc1 Scares NFLT.GBC.S1 Homos 3.85 

417819 AI253112 Hs.133540 ESTs 3.84 

409536 BE244200 Hs.55075 KIAA0410 gene product 3.83 

423129 L44396 Hs.124106 Homo sapiens cDNA FIJ1 1941 fts t ctoneHE 3.83 

453884 AA355925 Hs36232 WAA0186 gene product 333 

431193 AW749505 Hs596770 K1AA1719 protein 3.81 

409262 AK000631 Hs32256 hypothetical protean FU20624 330 

425568 AW963118 Hs.161784 ESTs 3.78 

441085 AW136551 Hs.181245 Homo sapiens cDNA FU12532 fis, clone NT 3.77 

428079 AA421020 Hs508919 ESTs 3.77 

412490 AW803564 Hs588850 Homo sapiens cDNA: FU22528 fis, clone H 3.76 

435354 AA678267 Hs.117115 ESTs 3.75 

436535 AW295687 Hs.254420 ESTs 3.74 

420439 AW270041 Hs.1 93053 eukaryotic translation initiatiori factor 3.72 

435090 AI640635 Hs.1 16468 EST 3.71 

416265 AA177088 Hs.190065 ESTs 3.70 

417715 AW969587 Hs.86366 ESTs 3.67 

435677 AA694142 Hs593726 ESTs, WeaWy similar to TSGA RAT TESTJS 3.67 

438607 AW080237 Hs552884 ESTs 3.66 

408194 AA601038 Hs.191797 ESTs, Weakly simitar to S65657 alpha~1C- 3.65 

417211 T97617 Hs.269092 ESTs 3.60 

435538 AB011540 Hs.4930 low density lipoprotein receptor-rented 339 

410390 AAS76905 Hs.125286 ESTs 338 

438818 AW979008 Hs.222487 ESTs 3.57 

431416 AA532718 HS.17B604 ESTs 3.57 

433517 AW022133 Hs.189838 ESTs 336 

428355 BE256452 Hs2257 vitronectin (serum spreading factor, som 336 

432954 AI076345 Hs.214199 ESTs 333 

434466 AB037829 H&3862 regulator of nonsense transcripts 2; DKF 3.53 

421933 R98881 HS.1P9655 sex comb on midteg prosophteHke 1 332 

422082 AA016188 Hs.111244 hypothetical proteki 332 



437135 AL038624 Hs.208752 ESTs, Weakly similar to ALU8 JIUMAN ALU S 3.49 

424723 BE409813 Hs, 152337 protect argfnine N^nethyitransferase 3(h 3.49 

434280 BE005398 gb£M1-BN0116-150400-18WT02BN0116Homo 3.49 

407289 AA135159 Hs503349 Homo sapiens cDNA FU12149 fis, clone MA 3.48 



417670 R07785 gb:yf15cG6.r1 Soares fetal iiver spleen 3.48 

431615 AW295859 Hs535860 ESTs 348 

429355 AW973253 Hs292689 ESTs 3.45 

430068 AA464964 gbzx80f10.s1 Soares ovary tumor NbHOT H 3.45 

432929 AW207166 Hs.191265 ESTs 3.44 

437763 AA469369 Hs.5831 tissue inhfotor of mstaJloproteinase 1 3.44 

445674 BE410347 Hs.13063 transcription factor CA150 3.42 

408113 TB2427 Hs.134101 Homo sapiens cDNA: FL/20869 fis, done A 3.42 

408908 BE296227 Hs550822 sermarthreonine kinase 15 3.41 

432235 AA531129 Hs.190297 ESTs 3.41 

453985 N44545 Hs.251865 ESTs 3.41 

415736 AA827082 Hs591872 ESTs 336 

430220 BE378277 Hs.152230 ESTs 3.37 

426510 AW861225 Hs.194637 BANP homolog, SMAR1 homotog 337 

412104 AW205197 Hs.240951 Homo sapiens, Similar to RIKEN cDNA 2210 3.36 

411573 AB029000 Hs.70823 KIAA1 077 protein 333 

413816 AW958181 Hs.189998 ESTs 3.32 

426057 AI343641 Hs.185798 ESTs 3.32 

436280 AI690734 Hs.1 31 740 Homo sapiens cDNA:RJ22562 fis, cioneH 331 

449365 AW968261 Hs.1 18913 ESTs, Moderately similar to T46371 hypot 3.31 

440659 AF134160 Hs.7327 daudln 1 3.30 

436110 AA704899 Hs591651 ESTs, Weakly similar to 138022 hypotheti 359 

433862 086960 Ks.3610 WAAQ2Q5 gene product 3.29 

424624 AB032947 Hs.151301 Ca2*depCTdent activator protein for seer 359 

439955 AW203959 Hs.1 49532 ESTs 358 

417333 AU57545 Hs .42179 bromodornaln and PHD finger containing, 3 3.28 

436150 AW510927 Hs.125243 ESTs 357 

414900 AW452420 HS548678 ESTs 356 
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439349 A1660898 Hs.195602 ESTs 325 

428255 AI527478 Hs.187670 ESTs 324 

438217 T53925 Hs/107 fibrinogen-like 1 3.24 

429083 Y09397 Hs227817 BCU -related protein A1 324 

422244 Y0889O Hs.113503 karyopherin (importin) beta 3 322 

430178 AW449612 Hs.152475 ESTs 321 

413810 AW197644 Hs.19107 ESTs 320 

428728 NM.016625HS.191381 hypothetical protein 320 

437151 AA745618 Hs.194637 BANP homobg, SMAR1 homotog 3.19 
427051 BE178110 Hs.173374 Homo sapiens cDNA RJ10500 ffs. done NT 3.19 

438378 AW970529 Hs.86434 hypothetical protein FU2181 6 3.19 

439943 AW083789 Hs.124620 ESTs 3.18 

439280 AI125436 Ks.48752 ESTs 3.18 

452336 AA960961 Hs.305953 zinc finger protein 83 (HPF1) 3.17 

433713 AW976511 Hs.112592 ESTs 3.16 

414998 NM-002543HS.77729 oxidised low density lipoprotein (lectin 3.14 
407326 AA508857 Hs.187748 ESTs, Weakly similar to ALULHUMAN ALU S 3.14 

432722 AA830532 Hs.326150 ESTs 3.14 



419457 AA243146 Hs.209334 ESTs, Moderately similar to S23A^UMAN P 3.11 

449987 AW079749 Hs.184719 ESTs, WeaWy similar to ALU 1JHUMAN ALU S 3.11 

418522 AA605038 Hs.7149 Homo sapiens cDNA: FU21950 fis, done H 3.09 

409969 AW514668 Hs. 194258 EST s, Moderately similar to ALU5.HUMAN A 3.08 

436299 AKD00767 Hs.5111 hypothetical protein FU20729 3.08 

406687 M31126 Hs272620 pregnancy specific beta-1 -glycoprotein 9 3.07 

408242 AA251594 Hs.43913 PIBF1 gene product 3.07 

444614 R44284 Hs.2730 heterogeneous nuclear riborrucieoproteln 3.06 

459407 N92114 gb:za22h11.r1 Soares fetal fiver spleen 3.05 

433972 AI878910 HsJ3688 dsplatln re^'stance-assocteted overexpr 3.04 

427704 AW971063 Hs292882 ESTs 3.03 

440255 AI932285 Hs.160569 ESTs 3.03 

424542 AI860558 Hs272009 ESTs, Weakiy similar to ALU2JHUMAN ALU S 3.03 

413822 R08950 Ha272044 ESTs, WeaWy similar to ALU1.HUMAN ALU S 3.02 

433944 AL117518 H&3686 KIAA0978 protein 3.01 

440428 BE560954 gbfi 01 34771 9F1 NIH.MGCJ Homo sapiens cD 3.00 
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TABLE 21 A 



Table 21A shows the accession numbers for those pkeys lacking unigenelD's for Table 21 A. 
For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey; Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

409760 115373J AA302840 T93016 T92950 AA077551 

413497 1373771J BE177661 H06215 BE144709 BE144829 

417670 1692163.1 R07785T85948T86972 

418876 179960.1 AA740616AA654854AA229923 

419145 16221 7_1 N99538 AW973750 AA328271 H50994 AA558020 AA234435 N59599 R948 15 

423974 233842.1 AL118754 AA333202H38001 

430068 312849 J AA4649B4M85405AA947566 

432340 345248 J AA534222AA632632T81234 

433586 370470 1 T85301 AW517087 AA601054 BE073959 

434280 382816 1 BE005398 AA628622 AA994155 

434966 396504 J AA657494 AI582663 AI531639 

438962 467390.1 BE046594 BE046667 AAB28585 A1207343 

439848 477806 J AW979249 D63277 AA846968 

440428 49370.-1 BE560954 

440638 499025.1 AI376551 T87714AA897445 

456332 179104.1 AA22B357AW841786AWB41716 
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TABLE 21B 

Pkey: Unique number corresponding to an Eos probsset 

Ref; Sequence source. Trie 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham (. et a!.' refers to the publication 

entitled The DNA sequence of human chromosome 22.' Dunham L et aL, Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NtposHion 
404571 7249169 Minus 112450-112648 
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TABLE 22: 177 GENES DOWN-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO NORMAL COLON 
TISSUE 

Table 22 shows 177 genes down-regulated in colon cancer derived liver metastases 
compared to normal colon tissue. These were selected from 59680 probesets on the 
Affjroetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" normal colon tissues was less than or equal to 0.25. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" normal adult tissue level was set to the 50th percentile. 

Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcenelD: Undone number 

UnkjeneTtfle: Unlgene gene title 

R1: Genes down mets vs. normal 



Pkey ExAccn Untgene© Unlgene Title R1 

425196 AL037915 Hs.1 55097 carbonic anhydrase II 0.03 

414522 AW518944 Hs.76325 step II spficing factor SLU7 0.03 

409153 W03754 H&50813 hypothetical protein RJ20022 0.03 

452594 AU076405 Hs.29981 solute carrier famSy 26 (sulfate transp 0.03 

424326 NM.014479HS.145296 cfelntegrin protease 0.04 

414798 AI286323 Hs.97411 hypothetical protein MGC1 2335 0.04 

432150 AK000224 Hs372789 hypothetical protein FU2021 7 0.04 

425205 NM.002153HS.155109 hydroxysteroW (17-beta) dehydrogenase 2 0.05 

437145 AF007216 Hs.5462 solute carrier family 4, sodium bkarbon 0.05 
447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 0.05 

414807 AI738616 Hs.77348 hydroxyprostaglandln dehydrogenase 15-{N 0.06 

428934 AF039401 Hs. 194659 cWoride channel calcium activated, fam 0.06 

432251 AW972983 Hs.232165 polycythemia rubra vera 1; ceD surface 0.07 

431727 AW293464 Hs.162031 ESTs 0.07 

421515 Y1 1339 Hs.105352 GaiNAcaipha-2, fr^alyttransferase I, I 0.07 

414555 N98569 Hs.76422 phosphoDpase A2, group IIA (platelets, 0.08 

412047 AA934589 Hs.49696 ESTs 0.08 

412056 T28160 Hs.778 guanylate cyclase activator 1B (retina). 0.08 

422440 NM_004812Hs.116724 aWo-keto reductase family 1, member 610 0.08 

450684 AA872605 Hs.25333 interteuWn 1 receptor, type II 0.09 

418935 T28499 Hs.89485 carbonic anhydrase IV 0.09 

433658 L03678 Hs.156110 immunogbbuBn kappa constant 0.09 

422260 AA315993 Hs. 105484 regenerating gene type IV 0.09 

433336 AF017986 Hs.31386 secreted frizzled-related protein 2 0.09 

426784 U03749 Hs.172216 chromogranin A (parathyroid secretory pr 0.09 

441888 AI733306 Hs. 128071 hypothetical protein FU21 302 0.10 

440624 AF017987 Hs.7306 secreted frizzled-retated protein 1 0.10 

420929 AI694143 Hs.296251 programmed ceil death 4 0.10 

429970 AK000072 Hs^27059 chtortde channel calcium activated, tarn 0.10 

417233 W25005 Hs.24395 small inducible cytokine subfamily B (Cy 0.10 

414802 AI793107 Hs.27018 Ris 0.10 

424566 M16801 Hs.1790 nuclear receptor subfamily 3, group C, m 0.11 

421996 AW583807 Hs.1460 glucagon 0.11 

423371 AU076819 Hs.1650 sdute carrier family 26, member 3 0.11 

406741 AA058357 Hs.74466 caiciiwembiyonfeantigerHelatedcellad 0.11 

414176 BE140638 Hs.75794 endomellal differentiation, lysophospha 0.11 

408741 M73720 Hs.646 carboxypeptidase A3 (mast cefl) 0.11 

424527 AW138558 Hs.267158 ESTs, Weakly similar to I54374 gene NF2 0.12 

426682 AV660038 Hs.2056 UDP glycosyltransferase 1 family, polype 0.12 

453967 AW009077 Hs.232947 ESTs 0.12 

425920 AL049977 Hs.162209 daudinS 0.13 

408134 AK000184 Hs.42945 acid sphingomyelinase-Gke phosphodieste 0.13 

457407 AA505035 Hs.195651 ESTs 0.13 

446500 U78093 Hs.15154 sushkepeatontalning protein, X chrom 0.14 

422487 AJ010901 Hs.198267 mucin 4, tracheobronchial 0.14 

409196 NML001874HS.334873 carboxypeptidase M 0.14 

416426 AA180256 Hs.210473 Homo sapiens cDNAFU 14872 fis, done PL 0.14 

406636 L12054 gfcHomo sapiens (ctone WR4.12VL) antMh 0.14 
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457982 AW856093 Hs.183617 ESTs 0.14 

407744 AB020B29 Hs.38095 ATP-brnding cassette, sub-famfly A (ABC1 0.14 

430378 Z29572 Hs2556 tumor necrosis factor receptor superfaml 0.14 

424885 AI333771 Hs.82204 ESTs 0.14 

423555 AW95B201 Hs.178589 hepatocellular carcinoma antigen gene 52 0.14 



444237 AA336878 Hs.9842 Human DNA sequence from done RP4-7B8L20 0.14 
445848 AA774624 Hs.13377 Homo sapiens done 23649 and 23755 unkno 0.14 
451062 AL110125 Hs25910 Homo sapiens mRNA; cDNA DKFZp564C1416 (f 0.14 



436485 X59135 Hs.156110 irrmimoglobulin kappa constant 0.14 

423655 AA722425 Hs.182785 ESTs, Moderately stm9ar to 1207289A rev 0.15 

417332 AW972717 Hs.288462 hypothetical protein aJ21 511 0.15 

427506 AKD00134 Hs.179100 hypothetical protein FU20127 0.15 

430712 AW044647 Hs.196284 ESTs 0.15 

421666 ALG35250 Hs.1408 endoWn3 0.16 

425692 D90041 Hs.155956 r^acetyltransferase 1 (arylamine N-acety 0.16 

429412 NMJ)06235Hs2407 POU domain, class 2, associating {actor 0.16 

433745 AF075320 Hs28980 hypothetical protein FU14540 0.16 



450085 AW293791 Hs.60162 Homo sapiens cDNA: FU21528 fls, clone C 0.16 
417820 D87449 Hs.82635 UDP-glucuronlc actoVUDP-N-acetyigalacte 0.16 
406722 H27498 Hs.293441 Homo sapiens SNC73 protein (SNC73) mRNA, 0.16 



426488 XQ3350 Hs.4 alcohol dehydrogenase 1B (class I), beta 0.16 

436327 AA813075 Hs.120181 ESTs 0.16 

408873 AL046017 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 0.16 

429524 AB033037 Hs205293 K1M1211 protein 0.16 

447023 AA356764 Hs.17109 integral membrane protein 2A 0.17 



424264 D80400 Hs239388 Human DNA sequence from done RP1-304B14 0.17 

410310 J02931 Hs.62192 coagulation factor III (thromboplastin, 0.17 

432563 NMJ)13261Hs.198468 peroxisome proliferative activated recep 0.17 

406897 M57417 gb:Homo sapiens mucin (mucin) mRNA, part 0.17 

451096 BE383234 Hs25925 Homo sapiens, done MGS15393, mRNA, com 0.17 



447726 AL137638 Hs.19368 matrilin2 0.17 

409549 AB029015 Hs.54886 phosphofipase C, epsflon 2 0.17 

433334 AI927208 Hs231958 matrix metatoproteinase 28 0.17 

425849 AJ000512 Hs.296323 serum/glucocorticoid regulated kinase 0.17 
407360 X13075 gbiHuman 2a12 mRNA for kappa-lmmunogtobu 0.17 

430627 U61 148 Hs247685 atonal homolog 1 (Drosophiia) 0.17 

418807 NM004944H&88646 deoxyribonudease l-Iike 3 0.18 

453399 Z70295 Hs.32966 guarrytate cyclase activator 2B (uroguany 0.18 

422994 AW891802 H&296276 ESTs 0.18 

432134 A1816782 Hs. 122583 hypothetical protein FU21934 0.18 

400417 X72475 " 0.18 

443506 H10861 Hs.192124 ESTs, Weaxty similar to I3B022 hypotheti 0.18 
428470 AC002301 Hs, 184507 Homo sapiens Chromosome 16 BAG done CIT 0.18 

451928 AI823801 H&30315 CTCL tumor an6gense57-1 0.18 

429576 BE242628 Hs209061 sudD (suppressor of btmD6, Aspergillus n 0.18 

422106 D84239 Hs.111732 Fc fragment of IgG binding protein 0.19 
430304 AL122071 Hs.238927 Homo sapiens mRNA; cDNA DKFZp434H1235 (f 0.19 

452852 AK001972 Hs.30822 hypothefica) protein FU1 1110 0.19 

421904 BE143533 Hs.109309 hypothetical protein FU20Q35 0.19 

417165 R80137 Hs.302738 Homo sapiens cDNA: FLJ21425 fis, clone C 0.19 

417771 AA804698 Hs.82547 retinoic add receptor responder (tazaro 0.19 

452802 AU076403 Hs323468 elecfrorv^ransfenlng-tevoprotein dehyd 0.19 

450680 AF131784 Hs.25318 Homo sapiens done 25194 mRNA sequence 0.19 

420061 AWQ24937 Hs29410 ESTs 0.19 

426828 NM.000020HS.172670 activin A receptor type Il-Uke 1 0.19 

408190 AB032963 Hs.43577 ATPase, Class I, type 8B, member 2 0.19 

437682 AA476652 Hs.94952 Homo sapiens cDNA: FU23371 fis, done H 0.19 

449110 H56112 gbyq95f07 n Soares fetal Ever spleen 0.19 

446727 AB011095 Hs.16032 KIAA0523 protein 0.19 

408395 BE072425 Hs.44579 hypothetical protein FU20199 020 

423541 AA296922 Hs.129778 gastrointestinal peptide 0.20 

410850 AW362867 Hs.302738 Homo sapiens cDl^:FU21425fc,doneC 020 

412420 ALD35668 Hs.73853 bone morohogenetic protein 2 020 

423942 AF209704 Hs.135723 grycoilpid transfer protein 0.20 

421832 NM.016098HS.108725 HSPC040 protein 020 

459046 AA910339 Hs26216 LOC50627 020 

421360 AA297012 Hs.103839 erythrocyte membrane protein band 4.14i 020 

438091 AW373062 Hs.83623 nuclear receptor subfamily 1, group I, m 020 

403047 0.20 

421712 AK000140 Hs.107139 hypothetical protein 020 

427333 AF057797 Hs. 176658 aquaporm8 020 

421964 X73079 Hs.288579 polymeric Immunoglobulin receptor 0.20 

438089 W05391 Hs.83623 nuclear receptor subfamily 1, group I, m 021 

445200 AA084460 Hs.12409 somatostatin 021 

404854 021 

426390 AA377299 Hs.90431 ESTs 0.21 
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403381 021 

449833 R82252 Hs.106106 protein kinase (cAMP-dependent, cataiyti 021 

457718 F18572 Hs2297B ESTs, Weakly similar to ALU4„HUMAN ALU S 0.21 

435730 AB020635 Hs.4984 K1AA0828 protein 021 

431518 AA743462 Hs.165337 ESTs 021 

412589 R28660 Hs.24305 ESTs 021 

432584 AA928829 Ks.47099 hypothetical protetn FU21212 021 

428088 AFQ38007 Hs.166196 ATPase, Class I, type 8B, member 1 021 

429143 AA333327 Hs.1 97335 plasma glutamate carboxypeptidase 021 

414429 R51494 Hs.71818 5STs 022 

439670 AF088076 Hs.59507 ESTs, Weakly simflar to AC004858 3 U1 sm 022 

408697 M21 388 Hs.123017 Human unproductive^ rearranged Ig mtKh 022 

406663 U24683 Hs.302063 Immunoglobulin heavy constant mu 022 

407611 AW190902 Hs.40098 cysteine knot superfamffy 1 , BMP antagon 022 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 022 

430107 AA465293 Hs.105069 ESTs 0.22 

424273 W40460 Hs.144442 phospholipase A2, group X 022 

419559 Y07828 Hs.91096 ring finger protein 022 

413517 N76712 Hs.44829 ESTs, Weakly sim9ar to I38022 hypoihefi 022 

407243 AA058357 Hs.74466 caitinoembryonic antigen-related cell ad 0.22 

433906 AI167816 Hs.43355 ESTs 0.22 

446203 Z47553 Hs. 14286 flavin containing monooxygenase 5 022 

403740 022 

405701 0.22 

413554 AA319146 Hs.75426 secretogranin II (chromogranln C) 022 

419577 L36531 Hs.91298 integral, alpha 8 0.23 

451820 AW058357 Hs337353 ESTs 023 

424897 D83216 Hs.1 53684 frfezled-reiated protein 023 

422880 AF228704 Ha. 121 524 glutathione reductase 023 

430832 A1073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 0.23 . 

430753 AI432401 Hs2659 fibrinogen-like 2 0.23 

409060 A1815867 Hs.50130 necdin (mouse) homotog 023 

412228 AW503785 Hs.73792 comp!8flieirtcompoJient(3oVEpst^ 024 

414171 AA360328 Hs.865 RAP1A, member of RAS oncogene family 024 

417916 NM_006416Hs.82921 sohrte canter family 35 (CMP-siaDc aci 024 

414589 AA149791 Hs.68864 ESTs, Weakly similar to phosphatidylseri 0.24 

427167 AI239607 Hs.99196 hypothetical protein MGC11324 024 

440630 BE561430 Hs.239388 Human DNA sequence from clone RP1 -304B14 024 

423044 AA320829 Hs.97266 protocadherin 18 024 

441931 BE564830 Hs.23744 hypothetical protein FU 12899 0.24 

443060 D78874 Hs.8944 procoBagen C-endopeptidase enhancer 2 024 

405441 024 

407241 M34516 gbiHuman omega Ifcht chain protein 14.1 024 

415165 AW887604 Hs78065 complement component 7 0.24 

426447 AV655843 Hs.169919 etectron^ransfer^lavoprotein, alpha po 024 

410748 BE383816 Hs.12532 chromosome 1 open reading frame 21 024 

436032 AA150797 Hs.1 09276 latexln protein 024 

414256 AW410035 Hs.75862 MAD (mothers against decapentaptegic, Dr 024 

414197 W44877 Hs.55501 ESTs 024 

406836 AW514501 Hs.156110 immunoglobulin kappa constant 024 

437083 AW082597 Hs.244862 ESTs 025 

421709 AA159394 Hs.107056 CEM protein 025 

426512 AW511656 Hs.170177 Metsl (mouse) homolog 0.25 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Tables 
21 A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed Gene clusters were compiled using sequences derived from 
Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

Pkeyt Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

449110 798430J H561 12 H58047 AI630710 N58742 
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TABLE 22B 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbeis In this column are Genbank identifier (Gl) numbers. "Dunham 1. et ai" refers to the pubHcation 

entitled The DNA sequence of human chromosome 22.' Dunham I. et al., Nature (1999) 402:489495. 
Strand: indicates DNA strand from which exons were predicted. 

NLposltion: indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 

403047 3540153 Minus 59793-59968 

403381 9438257 Minus 26009-26178 

403740 7630882 Plus 86504-87227 

404854 7143420 Plus 14260-14537 

405441 7408124 Plus 100952-101283 

405701 4263751 Phis 93243-93364 
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TABLE 23: 175 GENES UP-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO COLON CANCER 
PRIMARY TUMOR SAMPLES CLASSIFIED AS DUKE'S B SURVIVOR 

Table 23 shows 175 genes up-regulated in colon cancer derived liver metastases compared to 
colon cancer primary tumor samples classified as Duke's B stage with a positive survival 
outcome (Duke's B survivor). These were selected from 59680 probesets on the 
Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" Duke's B survivor was greater than or equal to 3.0. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" Duke's B survivor level was set to the 50th percentile. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Genes up liver metastases vs Duke's B survivors 



Pkey ExAccn UnigenelD Unigene Trae R1 

428101 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 (ft- 9.08 

432572 A1660840 Hs. 1 91 202 ESTs, Weakly similar to ALUE_HUMAN MI 7.96 

424878 H57111 Hs.221132 ESTs 7.88 

428046 AW812795 Hs.155381 ESTs, Moderately similar to 138022 hypot 7.48 

407284 A1539227 Hs.214039 hypothetical protsin RJ23556 7.45 

439943 AW083789 Hs.124620 ESTs 7.00 

442369 AI565071 Hs.159983 ESTs 7.00 

415116 AA160363 Hs.269956 ESTs 6.98 

433517 AW022133 Hs.189838 ESTs 6.70 

437176 AW176S09 Hs.42346 calcinetirin^inding protein calsarcin-1 6.68 

440524 R71264 Hs.16798 ESTs 6.62 

408806 AW847814 Hs.289005 Homo sapiens cDNA: RJ21532 fe, clone C 638 
448974 AU049390 Hs.22689 Homo sapiens mRNA; cDNA DKF2p58601318 (f 628 

412088 AI689496 Hs, 108932 ESTs 6.04 

417670 R07785 gb:yf15c06.r1 Soares fetal Bver spleen 5.95 

440774 AI420611 Hs.127832 ESTs 5.91 

426086 T94907 Hs.188572 ESTs 5.90 

436100 AA704606 Hs.143842 ESTs, WeaJdy similar to 2004399A chromos 5.64 

453204 R10799 Hs.191990 ESTs 5.84 

407289 AA135159 Hs.203349 Homo sapiens cDNA FU12149 fls, done MA 5.67 

432435 BE218686 Hs.282070 ESTs 5.61 

434963 AW974957 Hs.288719 Homo sapiens cDNA RJ12142 fis, done MA 5.60 

421221 AW276914 Hs.326714 Homo sapiens clone IMAGB713177, mRNA se 5.54 

407328 AA508857 Hs.187748 ESTs, Weakly similar to ALU1_HUMAN ALU S 5.51 

440980 AL042005 Hs.1117 tripeptidyl peptidase II 5.48 

443651 W22152 Hs 282929 ESTs 5.42 

412668 AA455195 Hs.10056 hypothetical protein FU14621 529 

444838 AV651680 Hs208558 ESTs 5.24 

433312 AI241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 5.11 

430665 BE350122 Hs.157367 ESTs, Weakly simitar to I78885 serine/th 5.11 

434966 AA657494 gb^tS6ID4^1 NCLCGAP.Pr3 Homo sapiens 5.10 

426897' AW976570 Hs.97387 ESTs 5.08 

432954 AI076345 Hs.214199 ESTs 5.07 

431416 AA532718 Hs.178604 ESTs 5.00 

420717 AA284447 Hs.271887 ESTs 4.96 

424950 AA602917 Hs.156974 ESTs 454 
438982 BE046594 gbJin41c1U1 NCLCGAP„RDF2Homosapiens4.92 

419999 AI760942 Hs.191754 ESTs 4.89 

435812 AA700439 Hs.188490 ESTs 4.86 

418662 AI801098 Hs.151500 ESTs 4.79 

428065 AI634046 Hs.157313 ESTs 4.77 

407618 AW054922 Hs.53478 Homo sapiens cDNA FU12366 fis, done MA 4.75 

435981 H74319 Hs.188620 ESTs 4.74 

419145 N99638 gbza39g11,r1 Soares fetal liver spleen 4.73 

432340 AA534222 gb:nj21d)2*1 NdCGAPJW Homo sapiens 4.72 

447982 H22953 Hs.137551 ESTs 4.72 
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449509 AA001615 Hs.84561 ESTs 4.72 

407946 AA226495 Ks.154292 ESTs 4.70 

438607 AW080237 Hs252884 ESTs 4.68 

438406 BE273296 H&254467 Homo sapiens cDNA RJ13255 fo, done OV 4.62 

426818 AA554827 Hs.289115 DKFZp434A0131 protein 4.62 

452220 BE158006 Hs212296 ESTs 4.60 

436823 AW749865 Hs293645 ESTs, Weaidy similar to 138022 hypotheli 4.60 

433854 AA610649 Hs.333239 ESTs 4.56 

413816 AW958181 Hs.189998 ESTs 4.52 

428079 AA421020 Hs208919 ESTs 4.52 

421097 AI280112 Hs.125232 Homo sapiens cDNA FU1 3266 fts, done OV 4.50 

417035 AA192455 Hs22968 Homo sapiens clone IMAGE:451939, mRNA se 4.48 

423974 AL118754 gfcDKFZp761P1910_r1 761 (synonym: hamy2) 4.44. 

449618 AI076459 Hs.15978 KIAA1272 protein 4.44 

431615 AW295859 H&235860 ESTs 4.44 
418876 AA740616 gb:ny97f11.s1 NCLCGAP.GCB1 Homo sapiens 4.43 

428279 AA425310 Hs.155766 ESTs, Weakly simflar to A47582 B^sfl gr 4.42 

430573 AA744550 Hs.136345 ESTs 4,42 

430929 AA489166 Hs.156933 ESTs 4.40 

446099 T93096 Hs.17126 hypothetical protein MGC15912 4.40 

439362 AI954880 Hs.134604 ESTs 436 
421999 U50535 Hs. 110630 Human BRCA2 region, mRNA sequence CG006 4.35 

434220 AJ 174777 H&283039 Homo sapiens PR02492 mRNA, complete cds 433 

432925 AA878324 Hs.192734 ESTs 432 

417819 AI253112 Hs.133540 ESTs 4.30 

426848 H72531 Hs36190 ESTs 430 

429831 AA564489 Hs.137526 ESTs 4.30 

433735 AA608955 Hs.109653 ESTs 430 

418884 AA230228 Hs.59197 ESTs 428 

413243 AA769266 Hs.193657 ESTs 426 
431749 AUM9263 Hs.306292 Homo sapiens mRNA; cONA DKFZp564F133 (fr 423 

426054 AI948668 Hs.266619 ESTs 422 

413967 AW204431 Hs.1 17853 ESTs, Weakly similar to 138022 hypotheli 4.22 

433230 AW136134 H&220277 ESTs 422 

421057 T58283 Hs. 10450 Homo sapiens cDNA: FU22063 fis, done H 422 

423578 AW960454 Hs.222830 ESTs 421 

439717 W94472 Hs.59529 ESTs, Moderately similar to ALU1_HUMAN A 420 

443696 AW607444 Hs.134622 ESTs 4.20 

432722 AA830532 Hs.326150 ESTs 4.18 

435756 AI418466 Hs.33665 ESTs 4.14 

428825 A1084336 Ha 123783 ESTs, Weaidy similar to (38022 hypothec 4.14 

439451 AF086270 Ks278554 hsterochrornatiiviike protein 1 4.12 

445943 AW898533 Hs.181574 ESTs 4.12 

450219 AI626999 Hs.224624 ESTs 4.12 

431379 AA504264 Hs.182337 peptidytprolyi isomerase A (cydophi&i 4.11 

432451 AW972771 Hs292471 ESTs, Weakly similar to ALU1J4UMAN ALU S 4.10 

443148 A1034357 Hs211194 ESTs, Weakly similar to ALU8_HUMAN ALU S 4.08 

450177 AI698091 Hs.107845 ESTs 4.08 

420911 U77413 Hs.100293 CWinked N-acetyigtucosamine (GlcNAc) tr 4.06 

421114 AW975051 Hs293156 ESTs, Weakly similar to 178885 serine/th 4.06 

432731 R31178 Hs.257820 fibronectin 1 4.06 

433588 AI056872 Hs.1 33386 ESTs 4.06 

434658 AI624436 Hs.310286 ESTs 4.06 

444040 AF204231 Hs.182982 goJgirh67 4.06 

429512 AA453987 Hs.144802 ESTs 4.06 

443349 AI052572 Hs.269864 ESTs, Weakly similar to ALU1JHUMAN ALU S 4.04 

439867 AA847510 Hs.161292 ESTs 4.04 

425955 T96509 Hs248549 ESTs, Moderately similar to S65657 alpha 4.02 

431393 AW971493 Hs.134269 ESTs, Highly similar to cytokine recepto 4.00 

432125 AW972667 Hs287510 Homo sapiens cDNA FU 12300 fis, done MA 4.00 

435468 AW362803 Hs.166271 ESTs 3.97 

412059 AA317962 Hs249721 ESTs, Moderately similar to PC4259 fern 3.95 

446682 AW205632 Hs211198 ESTs 3.95 

441328 AI982794 Hs.159473 ESTs 3.92 
455778 BE088746 gb^M2^0693-210300-123KiraBT0693Hofno3.90 

438996 AW748336 Hs.168052 KIAA0421 protein 3.86 

418303 AA215701 Hs. 166541 ESTs, Weakly similar to 138022 hypotheli 3.85 

444816 Z48633 Hs283742 H^aptens mRNA tor retrotransposon 3.84 

429355 AW973253 Hs292689 ESTs 3.83 

438578 AA811244 Hs.164168 ESTs 3.83 

432945 AL043683 Hs.8173 hypothetical protein FU10803 3.83 

435318 T97301 Hs.18026 ESTs 3.82 

449941 AW450536 Hs.209260 ESTs 3.60 

424915 R42755 Hs23096 ESTs 3.76 

449987 AW079749 Hs.184719 ESTs, WeaWy similar to ALU1JWMAN ALUS 3.76 

416265 AA177088 Hs.190065 ESTs 3.75 
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413497 BE177661 gb«C1-KTO59B^03O(W)1 1-M2 HT0598 Homo 3.74 



412093 BE242691 Hs.14947 ESTs 3.74 

413822 R08950 Hs.272044 ESTs, Weakly similar to ALU1 J4UMAN ALU S 3.73 

431915 AK000777 Hs.272197 Homo sapiens cDNAFU20770fis, done CO 3.68 

434442 AA737415 Hs.152826 ESTs 3.63 

434959 AW974949 Hs.186564 ESTs, Weakly similar to 138022 hypolheli 3.63 

427704 AW971063 Hs.292882 ESTs 3.62 

426510 AW861225 Hs.194637 BANP homotog, SMAR1 homolog 3.60 

435714 AA699325 Hs.269880 ESTs 3.60 

432598 AI341227 Hs.157106 ESTs 3.57 

438543 AA810141 Hs.192182 ESTs 3.55 

422068 AI807519 Hs.104520 Homo sapiens cONA FU 13694 fis, done PL 3.54 

418259 AA215404 Hs.137289 ESTs 3.54 

428290 AI932995 Hs.183475 Homo sapiens ctone 25061 mRNA sequence 3.49 

419457 AA243146 Hs.209334 ESTs, Moderately similar to S23AJWMANP 3.47 

439312 AA833902 Hs.270745 ESTs 3.47 

408784 AW971350 Hs.63386 ESTs 3.45 

456332 AA228357 gb:nc39d05.r1 NCLCGAPJT2 Homo sapiens 3.45 

424762 AL119442 Hs.183684 eukaryotic translation Initiation factor 3.44 

442884 AI076570 Hs.134053 ESTs 3.44 

421023 AW449855 Hs.96557 Homo sapiens cDNA FU12727 fis, ctone NT 3.43 

434575 Al 133446 Hs.299954 Homo sapiens clone FLB7723 PR02Q55 mRNA, 3.42 

430433 AA476883 Hs^73768 ESTs 3.39 

419317 AA236282 Hs.17231B ESTs 3.38 

448710 TS2926 HsJ04164 ESTs 3.37 

439322 H72245 Hs.188635 ESTs 3.37 

430332 R51790 Hs^39483 Human ctone 23933 mRNA sequence 335 

411755 BE327036 Hs.117494 ESTs 3.33 

427882 AA640987 Hs.193767 ESTs 3.28 

438899 AF085833 Hs.135624 ESTs 3.28 

436535 AW295687 H&254420 ESTs 3.25 

434936 AJ2B5970 Hs. 183817 ESTs 3.22 

451730 AF095687 Hs.26937 brain and nasopharyngeal carcinoma susce 3.18 

447514 AI809314 Hs.208501 ESTs, WeaHy similar to B34087 hypotM 3.18 

413672 BE156536 gb:QWtfT036W101(XM)91-h10 HT0368 Homo 3.16 

435073 AA664078 gbac04a05s1 Stratagene lung (937210) H 3.13 

450295 AI766732 Hs.210628 ESTs 3.13 

419341 N71463 Hs.118888 ESTs, Weakly simSar to ALU1_HUMAN ALU S 3.13 

434495 AW352170 Hs.129086 Homo sapiens cDNA FU12007 fis, ctone HE 3.12 

408113 T82427 Hs.194101 Homo sapiens cDNA: FU20869 fis, done A 3.12 

456437 A1924228 Hs.1 15185 ESTs, Moderately sonflar to PC4259 fern 3.12 

421489 AI922821 Hs.32433 ESTs 3.12 

436090 AI640635 Hs.116468 EST 3.11 

450230 AW016607 Hs.201582 ESTs 3.11 

438011 BE466173 Hs.145696 splicing factor (CC13) 3.09 

418720 AI381687 Hs.39526 ESTs 3.09 

433102 AI343966 Hs.158528 ESTs 3.08 

436150 AW510927 Hs.125243 ESTs 3.05 

440116 AI796851 Hs.283108 hemoglobin, gamma G 3.04 

414900 AW452420 Hs£48678 ESTs 3.04 

435937 AA830893 Hs.119769 ESTs 3.02 

424848 AI263231 Hs.327090 EST 3.02 

435354 AA678267 Hs.1 17115 ESTs 3.00 



234 



WO 02/068677 



PCT/US02/06001 



TABLE 23A 

Table 23A show the accession numbers for those pkeys lacking unigenelD's for tables 1- 
20A, 21A, 22A, and 23A. For each probeset we have listed the gene cluster number from 
which the oligonucleotides were designed. Gene clusters were compiled using sequences 
derived from Genbank ESTs and mRNAs. These sequences were clustered based on 
sequence similarity using Clustering and Alignment Tools (DoubleTwist, Oakland 
California). The Genbank accession numbers for sequences comprising each cluster are 
listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 



413497 1373771 1 BE177661 H06215 BE144709 BE144829 

413672 1382512.1 BE156536 BE156439 BE156700 BE156449 BE156653 BE155533 BE156524 BE156670 BE156721 BE156723 

417670 1692163.1 R07785 TB5948 TB6972 

418876 179960.1 AA740616 AA654854 AA229923 

419145 182217J N99638 AW973750 AA328271 H90994 AA55B020 AA234435 N59599 R94815 

423974 233842.1 AL118754 AA333202 H38001 

432340 345248 J AA534222 AA632632T81234 

434966 396504 J AAS57494 AI582663 A1581639 

435073 399701.1 AA664078 AW363313 AA8050Q9 

438962 467390.1 BE046594 BE046667 AA828585 AI207343 

455778 1364506.1 BE088746 BE088802 BE088755 BE088876 BE088947 BE088881 BE088952 

456332 179104J AA228357AW841786AW841716 
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TABLE 24: 34 GENES DOWN-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO COLON CANCER 
PRIMARY TUMOR SAMPLES CLASSIFIED AS DUKE'S B SURVIVOR 

Table 24 shows 34 genes down-regulated in colon cancer derived liver metastases compared 
to colon cancer primary tumor samples classified as Duke's B stage with a positive survival 
outcome (Duke's B survivor). These were selected from 59680 probesets on the 
Afiymetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" Duke's B survivor was greater than or equal to 0.25. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" Duke's B survivor level was set to the 50th percentile. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Un'genetD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Genes down liver metastases vs Duke's B survivois 



Pkey ExAccn Unigene ID Unigene Title 



R1 



414522 AW518944 Hs.76325 step II splicing factor SLU7 

416768 AA363733 Hs.1032 regenerating islet-derived 1 alpha {pane 

409153 W03754 Hs£0813 hypothetical protein FU20022 

414555 N98569 Hs.76422 phospnolipase A2, group tlA {ptatE4ets, 

418007 M13509 Hs.83169 matrix metaiioproteinase 1 (interstitial 

424326 NM.014479HS.145296 disintegrin protease 

428934 AF039401 Hs.194659 chloride channel, calcium activated, tarn 

417233 W25005 Hs.24395 small Inducible cytokine subfamily B (Cy 

422260 AA315993 Hs.105484 regenerating gene type IV 

425196 AL037915 Hs.155097 carbonic anhydrase It 

433336 AF017986 Hs.31386 secreted frizzteo^eiated protein 2 

450685 115533 Hs.423 pancreatitis-associated protein 

407811 AW1 90902 Hs.40098 cysteine knot superfemily 1, BMP antagon 

414798 AI286323 Hs.97411 hypothetical protein MGC1 2335 

452852 AKD01972 H&30822 hypoftetlcaJ protein FU11 110 

447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 

423541 AA296922 Ha. 129778 gastrointestinal peptide 

425071 NM_013989Hs.154424 detodinase, ioctoftyronine, type II 

406636 L12064 gb:Homo sapiens (done WR4.12VL) amUh 

421515 Y11339 Hs, 105352 GalNAc alpha-2, 6-slalyltransferase I, I 

428368 BE440042 HsA3326 matrix metaiioproteinase 3 (stromelysln 

414812 X72755 Hs.77367 monokine induced by gamma Interferon 

452594 AU076405 Hs.29981 solute carrier family 26 (sulfate transp 

428227 AA321649 H&2248 small Inducible cytokine subfamily B (Cy 

408741 M73720 Hs.646 carboxypeptidase A3 (mast ceil) 

453064 R40334 Hs.B9463 potassium targe conductance caMunvacti 

431727 AW293464 Hs.162031 ESTs 

433658 L03678 Hs.156110 immunoglobulin kappa constant 

442064 AI422867 Hs.88594 ESTs 

417880 BE241595 Hs.62848 selectin L (lymphocyte adhesion molecule 

430280 AA361258 Hs.237866 tnterteukin 7 receptor 

452877 AI250789 Hs.32478 ESTs 

410310 J02931 Hs.62192 coagulation factor 111 (thromboplastin, 

402408 



0.05 
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TABLE 24B 



Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham I. et ah, Nature {1399) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: indicates nucleotide positions of predicted axons. 



Pkey Ref Strand NLposftion 
402408 9798239 tonus 110325-110491 



4 
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TABLE 25: 

Table 25 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, and ExAccn for all of the 
sequences in Table 26. Seq ID No links the nucleic acid and protein sequence information in 
Table 26 to Table 25. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

UnigeneTitle: Unigene gene title 

Seq.ID.No,: Sequence Identification Number found In Table 26 



Pkey ExAccn Unigene© UnigeneTitle Seq ID No. 

426101 AL049987 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 14 

419145 N99638 gb 5&6 

426818 AA554827 Hs.340046 DKFZp434A0131 protein 7&8 

421057 T58283 Homo sapiens cDNA 9 

446619 AU076643 Hs.313 secreted phosphoprotBin 1 (osteopontin, 10 & 11 

431958 X63629 Hs.2877 cadherin 3, type 1 , P-cadherin (placenta 12 a 13 

409041 AB033025 HS.50Q81 Hypcmetkal protein, XPJ551 860 (K1AA1 19 14&15 

443162 T49951 Hs.9029 DKFZP434G032 protein 16&17 

436385 BE551618 Hs.144097 ESTs 18-20 

447033 AI357412 Hs.157601 ESTs 2U22 

439608 AW864696 Hs.301732 hypothetical protein MGC5306 23-27 

449032 AA045573 Hs.22900 nuctear factor (erythroiWerived 28&29 

442577 AA292998 Hs.163900 ESTs 30 & 31 

429970 AKD00072 Hs.227059 chloride channel, calcium activated, fam 32 a 33 

424566 M16801 Hs.1790 nuclear receptor subfamily 3, group C, m 34 a 35 

457407 AA505035 Hs.345911 ESTs 36 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 37 a 38 

417332 AW972717 Hs.288462 hypothetical protein FU21511 39 a 40 
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TABLE 25A 

Ptey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

409041 10962-2 AB033025 AL359061 AL045836 AI751521 A1752804 A1752650 AA853580 AI752290 AA853460 AI752769 M852309 

AA853785 AA853219AW068503 AI7520S9 AL049389 AW068368 BE439518 W52813 BE141B33 A1940574 AT750606 AL109718 
AA242845 AA315795 AA307741 AW9546Q3 AI752070 AA350794 AI752649 AA307755 AW951677 AA298896 BE439692 
AAB52453 AW068826 AW853984 AA418236 AA639417 AW290917 AI750592 AI752768 AL045837 AI926513 AW262903 
BE439819 A1459360 AW339074 AW295181 AW029483 AI750945 AI750659 AI752525 AI147688 BE440122 AI751522 AW73816 
AI752291 A1694639 AI92581 6 AA599476 AA242752 AW021892 AI755098 AW469299 AW769363 AA853579 AI784082 
AA852454 AI925501 AA976657 AW150473 AW166734 

417332 166755 1 AW972717 AA523805 AI962905 AI373245 AW235545 AI812045 AW589434 AI826824 AW572339 AI377551 AA195718 
AI868470 

419145 182217 1 N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 
421057 196849 1 T58283AA765038AA283052 H99396AA814751 AI032674 N81016 N81017 BE222349 AA830545 
424566 2408.1 M16801 NMJJ00901 057171 AL04132BAF068623 A1201 179 AA151766 AA568349 AI698649 AI692765 BE327401 AA744953 
AA744951 AW361986 AV651840 T29894 AW945146 AW945145 W24096 AI183952 AI458972 AW190993 AI765359 AI634663 
AI741201 AW418944 AI767551 AA679687 AW772342 AW629508 BE504300 AI251790 AI522294 AA724341 AW615402 
A1537570 AA470665 AI458375 AW768901 AA447079 T23537 AI783744 R44301 D56621 N91919 AA149749 
426101 26088 1 AL049987 AW362842 T78981 AA247541 AI217018 AW961515 AA632986 AA663108 BE326465 AW872412 AIQ24689 AA453725 
BE150456 AA229448 AA442638 AA442648 AI916737 AA460220 AA868553 AI827987 AI0Q5467 R31 132 AI742087 AA442379 
N56349 AW769479 AI860142 AI917507 AA813604 AI860141 AM59289 AA522837 AI354470 AI921333 BE466760 AW971 193 
AW103830 AW277065 AW020895 Al 187977 N28268 A1084517 R95914 AA833517 AA563934 AA437299 AA436880 AA447794 
AA812876 AA653178 R31089 AI472712 R64648 AA600372 AA229164 AA703066 AW270324 AI1 91725 AA551512 AA493776 
426818 272427.1 AA554827 AA701001 AW972954 AL039129 AA385540 AA911663 
429970 31134 1 AKD00072AW840683AW843764AW84^ 
AK000138 

430378 3170J Z29572 AW976377 AA286871 AA633372 AA987627 AA743176 AI865358 AJ0C6884 AF031845 Z14955 
431958 3394 J X63629 NMJ01793 BE175433 BE153414 BE153425 AW364593 BE315317 AW950190 AA314252 BE142943 AW365220 
AW368405 BED 04269 AW366568 AL040609 AI829273 AI591 168 BE146183 AJ631 060 AI830793 W78081 W92295 AI927422 
BE009313 AI371793AW993031 AI204659 AA535113 AW993030 AI190281 AA555159 AW269637 AW993146 AH 49268 
AA425217 AW4731 94 AI890930 AA55 1 993 AI9521 06 W92308 A1827275 W45400 A1952328 AW609233 AA77461 1 AA551 779 
AI91 3967 AI798658 AI537658 AW517535 AA632236 AW339148 AW589522 AA836945 AA961263 AW015821 AW272946 
C00249 W4O333BE143121 
436385 418907.1 BH551618 A1207338 BE220568 A1261568 AW841737 AA714722AA946891 AI033239 

439608 47438.3 AW864696 AW338889 AJ342866 AA084522 AI244150 AI610339 AA425635 AA764930 AA976965 AW805766 AA057765 

AW805B45 AW802595 AA262971 AI969620 N75323 BE549060 AWB05725 AA025B09 N80776 N64595 AW073372 AA025493 
AI81 9475 AW028879 AW189496 AA442907 AW410368 AI91 1629 N71276 AW316922 AW805838 AA043880 AW189184 
AA449756 AA748153 AA705608 AI910643 AA279492 BE1601 19 AW805761 AA026262 AA782207 AW057652 AWB05768 
H21998AW194254AW275178AA44904OAA279582 N76314N54348 

442577 54549.1 AA292998AW238350 AI676W9AVVW4092BE566458AVV078677AW514801 AW073701 AW1706M 

AI923975 AC93326 AI700229 AW450814 AW628452 AI671457 AA937534 AI889694 AW339423 AW291875 AA551874 
AI682314AI926227AA397375 

443162 5613.1 T49951 AA025326 H04839 AA393303 R63101 W57657 W25628 AI961431 R71 165 N39940 H01548 K01759 AA641624 

AI634930 AA595296 AW994770 AW994747 BE047247 W38159 AA8581 33 AI701944 AW386273 AA676625 R24676 R79410 
AA922863 AI151319 H01013 AA024482 W02574 H01456 A1150858 AW1 35972 AW631 167 AI270332 H04750 T49622 AA004543 
R63061 AI093086 A1247539 H01225 H03388 AW472933 AA382448 AI219287 N27194 AW389613 AA649738 AW994764 
AW389614 R25176 AA897262 R71626 AA909471 R71240 AW811917 R76109 AI202312 AI866010 R76162AL1 17538 R79411 
T58656AW994674 

446619 685 1 AU076643 AA594604 AA346866 R18197 AA345192 AA337773 AA089791 R84435 AA337838 AW392167 AA075190 D55416 

AW1 50360 AW366257 AA579816 H93046 AW385689 AW385697 AM 86216 AW581 197 AL037509 AB019562 AA232626 R97905 
AW368019 AA242891 AW888502AI798331 AW385635 AW581221 T96947 H87989 AA369511 AA075191 R80742 AA366406 
W92752 H45586 AI864016 AW888497 BE0O4992 AI3841 10 AI624256 AI627593 W92728 AI682719 AA948208 AA171734 
N40517 J04765 AA379957 AA362403 NM000582 AF052124 AA300290 AA333447 AA343721 AW889543 BE565767 R76601 
R18015 AA100531 AA489963 AA101296 AA363513 AA344088AA336750 T77505 D56440AL1 10351 AL110331 F12195 R20175 
AA336664 H17766 AA363538 AA363590 D28760 AW578517 AA363531 A1614667 AA846699 AA366253 AW951285 AA297992 
AA327756 AW361609 AW815455 AW815427 AW815428 D54182 AW852200 AA171630 W27018 AW815864 AW379995 
AW378222 AW362610 BE566022 AW021023 C17352 D58435 AA345409 AI623991 AW020967 AI924770 AI799443 AW946393 
AA991239 AI571617 A1935181 A1923999 AI026895 AI860319 AW189873 AW270353 AW023584 AI81381 1 R99929 AW339056 
AA913152 AI636352 AI829394 AW151077 AW192580 AI5701 19 A1086391 AW021764 AW519154 A1375193 AW268678 
BE465690 AW019983 AW268654 AI573138 A1141809 AJ954553 A1559242 AA568945 AA886417 AW338527 A1635861 
BE465666 AI921239 AA968537 AI956027 AA91 1981 AIB27661 AW511046 BE6197B0 AI922227 AI811870 AW190131 
AW129220 AW51290B A1290757 A1819088 AI623771 AA775616 BE349419 AI126375 H88773 AI241758 AW275157 AB37848 
AI613425 AI631387AA922631 A1273463 AJ982898AW168957AI446481 BE501588BE048264AI499922AWQ23812BE220523 
AW973846 BE349276 AH41091 AA976060 AW973845 AA101270 AI582472 AW613675 A11 39360 AI282627 A1276044 N22345 
A1261875 AA634136 AI824468 AW887693 N27107 R21504 AI042223 N22067 AW196871 A158101 9 BE004973 AA252035 
N22087 AA570717 H1 1250 A1B04026 AA368098 AA021512 H08842 N26275 AA176368 AI758758 AA570371 AA232574 
BE221177 AW190221 AW471386 M78225 AM22140 AI624521 AA719775AA300291 AA568557 AI871430 BE465630 N71862 
T72587 W92721 H88774 D543B3AW1Q3693AW089986 AI382689 R42363 R44962T98770AA357374 AW022O74AI356207 
T29241 AW089431 AI933875 N66267 N67352 AA121786 AA363910 F09824 T95618 N66888 R80550 AI280667 AW196719 
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R59299 AW021049 K73469 AI95431 1 BE439454 AW079450 AW873850 AA348338 AW896006 AW268145 AA853631 H17650 
R39537 N66873 N67240 H06298 AI784199 R44260 AA9041 18 AA911756 F04S44 AA807809 AA665210 AI698448 T29719 
AA837240T64844 H08926 
447033 704603 J AI357412AI870708AI590539W07459 

449032 7945 1 AA045573 AA279920 R20139 AA372783 AW953529 H21473 R7831B W74359 AA022505 AA369091 AW084075 AA503638 

AV660815 AI216262 AA779843 BE219825 AF125534 AW972129 AI919099 AIB21283 AI300590 AI953701 AA331415 AW610546 
AW793050 AI953679 AW793047 AW610543 AI671 103 AW292105 AW0241 12 R77947 W76339 AA3051 1 1 M132523 AA227467 
H21401 AW366572 AW024129 AI701866 M654744 BEO42803 AI347173 AW866053 AW662710 R36639 AM69777 AA962733 
AI865366 AA501998 AW866054 BE178974 

457407 333252.1 AA505035 AW235098 AI634028 
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Table 26 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession ft see Table 25 & 25 A for complete list 



1 II 21 31 41 51 

I I I I I I 

CAATATAGTA CAATAACTAT TTGCATOACA TTTACATCGQ ATATTATGAG TGATCTAGAG 60 
TTGATATGAA GTATATGGGA GGATGTGCAA AGGTGATGTG CAAATACTAT GTCATTTTAT 120 
AGGGGGGACT TGAGTATCCT TTGTT ACCCT C AGO AGATCC TGAAAGCAGT CCCCCATGGA 180 
TACTGAGGGC TGACTGTATA GTCCTATCCT CACGGAACTT TCATTCTAAT GGGGGAAGAC 240 
TGACTATAAA CAAAATATAT GTAATAGGTG GTGGTAAGTA CCGTGGAGAA GTAACAAATG 300 
GGGCAAAGTG AGTTATACAG CTOCATTCTT AGAAACCTTG GAGTACTTTT CTTAGTTTAT 360 
ACTCGTGGTG GTTTCCTTTT GTCTCCnTA TTACATGGGA CTCTGACATG TGCCCATAGC 420 
TAGGGTGACA GTAGGATCTA CCOGATAGTA GGGTGGCAGT AGGATCTACC CAAAAAGCGT 480 
CCTGCTGATA CAGGACCAAA GCATCCTGTT GTTCTCGAGC CTATAAAAAG AGCTAATGGT 540 
GTTGCTTCTC TTAACTGTGG CCTCCTACAC TGTGTTTTGG ATGATTGGTG ATGTCTTGGA 600 
TATTCTGTTT CTTTGGAACT TTGAATATAC AACACTTTAC TAGGGAATTA GCAATGGAAG 660 
CAGAGCAAAG ATGTACAGAG GAAACAATGC GTAACTCTGA TGGAATTGAA GTCATGAGGC 720 
AGCAGAGAGC TTAAATTACA GCTTTAAAAA TTTTTATTTT TTAGAGGGAA TTTACTTGGG 780 
AGTAACAGCA GTAATAGTTA ACGG AGCCAG AATGCTTG AG TCATATAATT GCAAAGC AG A 840 
GTTGGGAGCA ACAGATGCTA AAGAGTAGTT GCTGTAGTTC CTCTTTGGGT CGTAGGAGCA 900 
GTTGTCATAT TACTATATAG CTACTGCATG AAGAAGAGTT CTTAGTGAGG CCTGGGTGAA 960 
CAGCTCTTCT TAGTATTCTG TGTGACCCCA TTTGACCnT TAACAAATCC CTAAGTAAAT 1020 
AAATAGCCCC TCAGG AAAAC TAAOTTTTTC TCTGCTGTTT TTTTGCTTGA GAG AGCTATA 1 080 
ACTG TAATAG ACTTATATTT CTGAACATTT TAGTGCTTGC CAATA TTTGG TAATATTTAT 1 140 
GTTTCCTATA TTTGTAATGA ACATTCTTCT TCCGGTACAT Ti'i J i'G JTAA ATTATTGTTT 1200 
GATGGATAAA AGTTCACCTT TTATTGTATA AAATTGACTG AGATTAATTT ATACACATTG 1260 
ACAATGGGTA AATAGAATTT TTCAGATTAT TAAAAGCTGA AGGATGACCA CGTAAGCAAA 1320 
AAAAAAAAAA AAAAAACCAA CAAAAATAAA CCCAAACCCC TCAAACAATT TCGAACACGA 1380 
AACATTCTTC TGATGCCGGC ATCOCTGCTT GCAGGTGTGA AGGGGGCAGG AATCAGCGAG 1440 
GTGTCCTGGG CTGAGTCCCC GGGGAAGAAT ATGAT 

Sea ID NO. 2 DNA .sequence 
Nucleic Acid Accession #: X83301.1 

1 11 21 31 41 51 

I 1 I I I I 

GCAAAGCCAG CTGGGCTCCT GAGTCCGGTG GGTACTTGGA GAACTTACTA CGTCTAGCTG 60 
GAGGATTGTA AATGCACCAA TCAGCATGCT GTGTCTAGCT CAAGATTTTC TCCATCCCCT 120 
TATTTTGGGC CAGTGGCTGT CATTACATAT GAGATGAGTC TCTTGAAGAC TACAG ATGAA 1 80 
CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC TGCACTCCAC 240 
ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG AAGCTTCCTA 300 
AGAAACTTCA CATCAGGTGA TGGATAGG AG CAACCCTGTA AAACCAGCCT TAGACTATTT 360 
TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT TCAAGTTGGA 420 
AGTGTGTCTT TTGAATGCAG AGAACAAAGT CGTGGACAAC CAGGCTGGGA CCCAGGGCCA 480 
GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC ACCCCGCCTA 540 
CCTGTACTCC TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC TCCCAGCCTG 600 
TGACCTTTGG GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT GTGGATGTGA 660 
TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG TTGATTCGGC 720 
TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTGAGACATC ACAATOCCAT TATTCAGAGC 780 
GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGGGCT GGTGAATTAC CAGATCTCCG 840 
TCAAGTGCAG TAACCAGTTC AAGTTGGAAG TATGTCTTTT GAATGCAGAA AACAAAGTCG 900 
TGGACAACCA GGCTGGGACC CAGGGCCAGC TGAAGGTGCT GGTGCCAACC TCTGGTGGCC 960 
GTACCTGATG CACGAACACC CCGCCTACCT GTACTCGTGG GAGGATGGTG ATTGCTCACA 1020 
CCAAAGCCTT GGACCCCTCC CAGCCTGTGA CCTTTGGGAC CAACTCCACC TAOGCAGCAG 1080 
ACAAGGGGGC TCTGTATGTG GATQTGATCC GTGTGAACAG CTACTACTCT TGGTATCGCA 1 140 
ACTACGGGCA CCTGGAGTTG ATTOGGCTGC AGGCCCTGCA GCTGGCCGCC CAGTTTGTGA 1200 
ATTGGTGTAA GACATCACAA TCCCATTATT CAGAGCGCGT ATGGAGTGGA AACGCTTGTA 1260 
GGGTTTCACC AGTCTTTCCC AGGGAACTCC GATGAAGTGT TCCAACAAAA TGAGCGAGTG 1320 
AACCAAGAAG AGGATGACAT TAGATCCAGG AGATACAACA GAGGAGATAA TCTCCAGGAT 1380 
GCCTGTGAAG AAAGATOCCT GGATCCCAGG ATGATTATAG GACAAGTTGT TCATAATOCA 1440 
GCAGGCCAGA AGACTTCCAG GGAAACTCAT TTCAAGATGA AAATGGACCA GCCGCAGTGG 1500 
CTCACGCCTG TAATACCAGC ACTTTGGGAG GCTGAGGCGG GCGGATCACT TGAGGTCAAG 1560 
AGTTTGAAAC TAGCCTGGCC AACGTGGCAA AACTCCATCT CTATTAAAG A TACAAAAATT 1620 
AGCCAGGCAT AGTGGTGCAT GCCTGTAGTC CCAGCTACTT GGGATGCTGA GGCAGGAAGA 1680 
ATTGCTTGAA CCTGGGAGGC AGAGTCTGCG GTGACCGAGA TCATGCCACT GCACTCCAGC 1740 
CTGGGTGACA G AGCCAG ACT CCGTCTCTAC TAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO; 3 protein sequence: 
Protein Accession #: CAA58280.1 

1 II 21 31 41 51 

I I I I I I 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLEVCLLNA ENKWDNQ AG TQGQLKVLG A 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLWDQL HLRSRQGGSV CGCDPCEQLL 120 
LLVSQLRAPG VDSAAAGRPV 

Seq ID NO; 4 DNA sequence 
Nucleic Add Accession #: BCO 02622.1 

1 11 21 31 41 51 

I I I I I I 

QGCACGAGGC TCCGCCOGCG GCCGGGATGC ACTAGGCAAA GCCAGCTGGG CrcCTGAGTC 60 
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CGGTGGGTAC TTGGAGAACT TACTACGTCT AGCTGGAQGA TTGTAAATGC ACCAATCAGC 120 
ATGCTGTGTC TAGCTCAAGA TTTTCTCCAT CCCCTTATTT TGGGCCAGTG GCTGTCATTA 180 
CATATGAGAA CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC 240 
TGCACTCCAC ACAATGCAAC AGAGTOAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG 300 
AAGCTTCCTA AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT 360 
TAGACTATTT TTCAAACAGG CTGGTQAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT 420 
TCAAGTTGGA AGTGTGTCTT TTGAATGCAG AAAACAAAGT CGTGGACAAC CAGGCTGGGA 480 
CCCAGGGCCA GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC 540 
ACCCCGCCTA CCTGTACTCG TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC 600 
TCCCAGOCTG TGACCTTTGT GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT 660 
GTGGATGTGA TCCGTQTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG 720 
TTGATTCAGC TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTAAGACATC ACAATCCCAT 780 
TATTCAGAGC GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGTCTT TCCCAGGGAA 840 
CTCCGATGAA GTGTTCCAAC AAAATGAGCG AGTGAACCAA GAAGAGGATG ACATTAGATC 900 
CAGGAGATAC AACAGAGGAG ATAATCTCCA GGATGCCTGT GAAGAAAGAT CCCTGGATCC 960 
CAGGATGATT ATAGGACAAG TTGTTCATAA TCCAGCAGGC CAGAAGACTT CCAGGGAAAC 1020 
TCATTCAAGG AGGTGAAAAT GATGGATGAC TCCTCCAAGA TGAAAATGGA CCAGCCGCAG 1080 
TGGCTCACGC CTGTAATAOC AGCACTTTGG GAGGCTGAGG CAGGCGGATC ACTTGAGGTC 1 1 40 
AGGAGTTTGA AACTAGCCTG GCCAACGTGG C AAAACTCCA TCTCTATTAA AAATACAAAA 1200 
ATTAGCCAAG CATAGTGGTG CATGCCTGTA GTCCCAGCTA CTTGGGATGC TGAGGCAGGA 1260 
AGAATTGCTT GAACCTGGGA GGCAGAGTCT ACAGTGAGCC GAGATCATGC CACTGCACTC 1320 
CAGCCTGGGC AACACAGTGA GACTCCATCT CAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: 5 Protein sequence; 
Protein Accession ft AAH02622.1 

1 11 21 31 41 51 

I i I I I ! 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLBVCLLNA HNKWDNQAG TQGQLKVLGA 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLCDQL HLRSRQGGSV CGCDPCEQLL 120 
LLVSQLRAPG VDSAAAGRPV 

Seq ID NO: 6 DNA sequence 

Nucleic Acid Accession ft see Table 25 & 25 A for complete list 



1 11 21 31 41 51 

I ! I I I ! 

ACCTGAGATC AGGAGTTCGA GATCAGCCTG AOCAATAGGG TGAAACCCCG TCTCTACTAA 60 
AAATACAAAA AATTAGCTGG ACACGATGGT GGGTGCCTGT GGTCCCGGCT ACTCGGGAGG 120 
CTGAGACAGG AGAATCAGTT GACCTGGGAG TTGGTGGTTG CAGTGAGCTG AGATCACACC 180 
ATTGCATTCC AAGCCTGGGC AACAAGAGTG AAACTCCATC GCAAAAAAAA AAAAGAAGGG 240 
GCATAATTTG TGGATGAGGA TTGGATATAA GOTAAAGGAT GGGACATTCT TGGACTTACA 300 
GATGGTGTGA TTGCCTGGCT AGAAGAAGAA TTCCCGGTCA AAAAGAAACC ATCAGCTTTC 360 
CAAGTGTG AA AG AG AG ATAA ATCTGTGAAG ATTATAGGGA CTACAGGAAA CTTAATCTTT 420 
TTCTTTGAAA AAGCAATTGT AGCAAAAAAA AAGAAAATTT CTTACTGTCA TCTAAAATTG 480 
ACATGGACAT CTTAGTGGAC TAGAAGTTAA GGGCATAAAT TCTCCCAGTG ATTTTTAATT 540 
TTAGCATTGT GATTAACACC TTCTAAAATT GCCAGAACTT AATAAATAAT TGCTTTTCAT 600 
TATTAGTATG CCATCAAATT TAGTAGCTGT TTCAGGCTTT AATGTGTCAA GCCTAAAATC 660 
CAGATTTTTG AGGATCTTCT CCCTCTTAAA AGAGTATTCA GTTAACTGCC GTAGAAATAC 720 
ACATGTATAC AAGGGCACTG TATACATCAG TCTAAAAAAT AAAAATATGT ATACGTTCTG 780 
GTGAGTCTAG CACAGCATTG CCCAATAGAA ATACCAATGG AGGTCACAAA TGTGGCCCAT 840 
ATAGGTTAAT TGGTAAATTT TCTNATAGNC ACC 

Sag ID NO: 7 DNA sequence 

Nucleic Arid Accession ft AK000942 

Coding sequence: 1204-1503 



1 11 21 31 41 51 
1 I I I I I 

GTAAAGGAAT GTCTTTTTAA TTCAGCTTTT CTTTTCTCCA TGCTAGTGTT ATCAGGTTTT 60 
GGTATTTATT TACTTACAGC ATATGTTATG AAGCTGGTTT GAAAATTGGT TTTAGATATA 120 
TCTGCAAGTT TACTACTTTG ACTGTAAAAA AAAAAAATGA AAAAGTAGTT GACATCTGTC 1 80 
CTCAGAAGAA GTTTGCAGGT TGCATATTTG TGTGTAAATA CACAGGCTAA AAGGTAATTT 240 
ATGTTCCTTG GGAATTG AAA TGGTCAGTGG CCCGTTACAG AAACTTATCA GTCATATATC 300 
AGCACCAGTT C^TTCTTTTG CACCTTAGGG ACCATCTGTC CCCTGAGGTG ACCTGAGAAA 360 
CAACCAGTTG CCCACAGACT GTTATTTCTT CAAGTG AGCC AGG ATTTG AT TTCACTGCCT 420 
TATATTCTAT TTTTAGTGTA CAGTGCTTTG ATTTTTTGG A AAAACTAAAT TTTAAACAT A 480 
-TTTGAAAAAT GTTATAAGAC TTGGACATTA AGTCTGTTGA TAGCCAAAGT CAGTTTACCA 540 
AAGTAAAACA AATAAATTCT ATGCTTCTTC ATTGTCAAAG AGCAGTCTGC CATCATGTGG 600 
ATATAAATGG ACTATGTAAA GTGACATGGT GCTT ACTCTC TACCTAATAA TAGCCTCCCT 660 
CCTGTTCCAA CAAGATAACC AACAGGTATA TTTAATTTAC CAGTTAATAT GTTTTGGATA 720 
ATTOGCTGCC TTGAAATGCT ATATGTTTTA TAGTACATCA TAGCTTTAGT 1 1 rCTTC ATA 780 
AGGAAATTAC AGTTACATCC TGGCTAACAT GGTGAAACTC CATCTCTACT AAAAATACAA 840 
AAAATTAGCC GGGCGTGGTG GCGGGCACTT GTAGTCCCAG CTACTCGGGA GGCTGAGGCA 900 
GGAGAATGGC GTGAACCCAG GAGGCGGAGG TTGCAGTGAG CCGAGATCGT GCCACTGTAC 960 
TCTGGCCTGG GAGACAGAGC G AGACTCCAT CTCAAAAAAA AAAAAAAAAA AAAAAAAAGA 1020 
GAGAGAGAGA CCTGGAGTAG AGATTCTGTC AAAG AACTTT TTCTTTCTTG AGAAGCATCT 1080 
GAAATGGAAT CTGTTGTCTC TTCGAAATAT GTACTGCTGT AACAGTGAAA CAACCCTCAG 1140 
AGTATGCCTT CGTGTGGGCT ACTCGTTGTG GTTTTGAACT TGGGGGAACT GTCTGTGTTT 1200 
GGGTCAAGAA TATGCAACTG GCTGGGCACA TTGGCTCACG CCTGTAATCC CAGCAATTTG 1260 
GGAGGCTGAG GCAGGCGGAT CACCTGAGGT CAGGGCTTCA AGACCAGACT GGCCAACATG 1320 
GTGAAACCCC GTCTCTACTG AAAATACAAA AATTAGCTGG GCATGGTGGC AGGTGCCTGT 1380 
AATCCCAGCT ACTCGGGAGG CTGACGTGAG AGAATCGCTT GAACCCGGGA GTTGGAGGTT 1440 
GCAGTG AGCC GAG ATTGCAC CATTGCACTC CAGCTTGGGC AACAAGAGTG AAACTCTTGT 1 S00 
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CTCAG 

Seq ID NO: 8 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25 A for complete list 



1 11 21 31 41 51 
I I I I I I 

GACTAGGCTG GGCAACATAG TG AG ACCTCA TCTCTAAAAT TAAAAAAATA AAAGCCACCA 60 
GAAAAAAACC TAAAAACATG CCAAGTGACA TCAGTCTTTG ATGAAAATGG CAGCAGAAGA 120 
GTGATGCCAT GGGTGGGGGT GGGAAATGCT ATTTCAGCAG AGAGGGAGCT GTCATGGAAG 180 
ACACCATGTG GCTGGGCACG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGATAGAG 240 
GCAGGTGGAT CCCTTGAGCT TAGGAATTTG AGACTAGCCT GGGCAATAAG AGTGAAACTC 300 
CATCTC AAAA AAAAAAAAAA AAAAAGGTGC ATGAAACATA TGAAGCAAAA AGTGAAAGTC 360 
CCCATTCTTT TCCTTTTTCC AG AGGTG ATT TTTGTGGCCA ATCTGGTTTC ATTCCCTCCC 420 
AGACACTTTT CTAGGCATCT ATGCGCCTCT ATTCACATAT AAACAAAATA GGAGTTTTCC 480 
TGTGCTTCCC TTAAATGGCA TATGTATCTT TCACTCl'lTl' TTTTCACCTA GTGGATCTTT 540 
AATACCTTAA AAGCTCAACC TGGGCTTGGT GCGGTGGCTC ATACGTGTAA TCCCAGGCCT 600 
TTGGGAGGCC AAGGTGGGAG GATCACTTGA GCTCAGGAGT TCCAGACCAT TCCAAAGCAA 660 
AAACAAAAGG ATTTTGAGAT CAGTGTGGGC AACTTAGCAA AACACCATCT CTTAAAAAAA 720 
AAAAAAAAAA 

See ID NO: 9 DNA sequence 

Nucleic Acid Accession fc BC010433. 1 

Coding sequence: 3-335 

1 11 21 31 41 51 

I I I 1 t I 

GGTCGCCCTC CGTCGTGGTC TGGCGTGTAT TCCGAGCCTT GGTGTCTGGC GGTTTCCGAG 60 
CGTTGGTGTC TGGCGGTTTC CGAGCGTTGG TGTCTGGCGG TTTCCGACCG TTGGTGTCTG 120 
GCGGTTTOCG ACCGTTGGTG TCTGGCACGC GCC ACCCTCT CrTGCTTTGG TTGCGCC ATG 1 80 
CCGATGTACC AGACAAG AAG ACAAGAAAAT GATTTGAGGA CAGCTTCAAT CGCGGTGTGA 240 
AGAAGAAAGC AGCAAAACGA CCACTGAAAA CAACGCCGGT GGCAAAATAT CCAAAGAAAG 300 
GGTCCCAAGC GGTACATCGT CATAGCCGGA AACAGTCAGA GCCACCAGCC AATGATCTTT 360 
TCAATGCTGC GAAAGCTGCC AAAAGTGACA TGCAGCACCG AGAAGTCCGC GTGAAGTGCG 420 
TGAAGGCTCT GAAAGGGCTG TACGGTAACC GGGACCTGAC CGCACGCCTG GAGCTCTTCA 480 
CTGGCCGCTT CAAGGACTGG ATGGTTTCCA TGATCATGGA CAGAGAGTAC AGTGTGGCAG 540 
TGGAGGCCGT CAGATTACTG ATACTTATCC TTAAGAACAT GGAAGGGGTG CTGATGGACG 600 
TGGACTGTGA GAGCGTCTAC CCCATTGTGT AGGCCTCTAA TTGAGGCCTG GCCTCTGCTG 660 
TGGGTGAATT TCTGTACTGG AAACTTTTCT ACCCTGAGTG CGAGATAAGA ACGATGGGTG 720 
GAAGAGAGCA ACGCCAGAGC CCAGGTGCCC AGAGGACTTT CTTCCAGCTT CTGCTGTCCT 780 
TCTTTGTGGA GAGCAAGCTC CACGACCACG CTGCTTACTT AGTAGACAAC CTGTGGGACT 840 
GTGCAGGGAC TCAGCTGAAG GACTGGGAGG GTCTGACAAG CCTGCTGCTO GAGAAGGACC 900 
AG AGCACGTG CCACATCG AG CCAGGGCCAG GGACCTTCCA CCTOCTAGGG TGAAAOCAGG 960 
AGAGATTGCT TGCTTCACTT GTACAAGGCA GGAACGGTGG CATGGGGTGG GGGAAACTTG 1020 
GAGTTGG AAG GTGGCTAATC TTTG ATTCTA TGTTTTTGAT CCTCCTGGCA CTCCAGACCT 1080 
GGGTGATGTG CAGGAGAGCA CACTGATAGA AATCCTTGTG TCCAGTGCCC AGCAACTCCT 1 140 
GCCTCAGCCT CCCGAGCAGC TGGGACTACA GGCGCCCGCC ACCACGCCTG GCTAACTTTT 1200 
TTGTGTTTTT AGTAG AG ACG GGTTTTCACC GTGTTGGCCA GGATGGTCTT G ATCTCTTGA 1260 
CCTTGTGATC CACCTGCCTC ATCATCCCAA AGTGCTGGGA TTACAGGCGT GAGCCACTGC 1320 
GCCCAGCATG TTAGACAATT TTTAATTCAT CCTCTCTGTG CTGTTGTTTT CTCAGCTGTG 1380 
AAAGGAATAT TCTGGTGGGG ACAAGGTTAC AGAGTTGCTG AGAGGGTCTC ATGACATGAA 1440 
GGTACTGGCC TTGGCACAGT GCCTGGGGGG GCGGGGACTC CGCACATGCC TGTG ATGTCA 1500 
CAGTTACTGT CAGTTCACAG CX3AACCTTCC CTCCTTTTCC TGTTGACTTT CCCACACTCC 1560 
TGTAACCCTC CCTCCCTCCC TTCTTCCTCT CTCTCTCTCT CACTCACGCA CACGCACACA 1620 
CACACACACA CACACACACA CACACACTCC ATTCACTOTC TCCATGACTC TGGAGTAAAC 1 680 
TAACGTCTCG AGTTGCCATT GGAAGCCCCG TTGTCCTCAT TTAGACTTTC ATGGGTTATA 1740 
GGCACTTTTG ACTTCCTGGG GTCCTTCTTC AGTTAAAAAA AAAAATTAGA AAATTAGGCC 1 800 
GGGCGTGGTG GCACATGCCT GTAATCCCAG CACCTTGGOC TCCCAAAGTG CTGGGATTAC 1860 
AGGAGTGAGC CACCATGCCC AGCCTCCGTT GTCCTCATTT AGACTTTCAT GGGTTATAGG 1920 
CACTTTTGAC TTCCTGGGGT CCTTCTTCAO TTAAAAAAAA AAAAAAAAAA 



Seq ID NO: 10 DNA sequence 

Nucleic Acid Accession ft see Table 25 & 25A for complete list 



1 11 21 31 41 51 
I I I I I I 

AGTGGNTCCC CCGGNCTGCA GGAATTCGGC ACGAGATCAT GATGGCTAAT ATTTCCTGAG 60 
CACCTTTCAT TCAGGCATGA TGCCAGGTGC ACCAACTTAC TTAATCCTCA TAGCCACCAC 120 
CTGAGCAAGC TCCTGTTTTA TAAATGGACC AGTTCTTGTT GCTGTTGTAC AAGTTATTTT 1 80 
CTTTCTATAA CGTCCTCCTT GTCCTCCTTC CACATTCTTA AAGAAACTTT CCCTTCCTTT 240 
AAAGTACTCA GGGAGCCCTG CATTGCTTCT TGAAGCCTTC TCCAGCTTCA TCATCTCACA 300 
GTGGTCTCTC TTTTCACTAA ATGTCCAATA TGCTGCACAT AAGTACCCCA AAGTTAGCAC 360 
AGGAATTGTT CCATGGCTGT CATATATGTT AAAAATCATT AAAAGTTCAT TTTTTCTCTC 420 
ATTATGGGAA GGATACATGC TCCTACTAGT AAATTTAGTA GGTAGAAAAA AATTATCACT 480 
ATCTAGACTG CTTTCCATTT AQTCTTTATG CATAGCTTTC GTGTCTGCCT ATTTTTACCT 540 
TGTGTTTGTA ACTTACTATT ATAAAATATG CGTCTCTATG TTCATTGTCA ACGATTATTT 600 
ACAATAACAT GGAGTGGATT T ACATGTATT CTCTATATTT GGATTAAAGG AG AT AG AGTA 660 
TGTG AAATTA AATGGGAGAA GTATCTGATA CATAACAGGC AATACAAATA TTATC ACATA 720 
GCGTCAATTT ATTTGTGAAT ATTGAAAGCT CCAAAAAAGA AAAAAAGTTT TTTTTTAATT 780 
CCCGTAATTA CTTATTGCAG TATTGTGTTC ATACAAACTG CTCAGTCATT TTGGAGAAAT 840 
AACAATTTTT TTCCTCATCA TG AAGTAAGG TATGCTCACT GCAAAAAAAA TCTAGAAAAT 900 
AAAGAGGAAC ATGCTAAAGA AAAGAATACT CCCATATAAT CTCTGTCTTC ATAAATAATC 960 
TTTTGTAACG CTTATACACT GCTGGTGGGA ATGTAAATTA GTTCAGCCAT TGTGAAAAGT 1020 
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AGCGTAGCAA TTCCTTOAAA AACTTAAAAT AOATTTACCG TTCAACCCAG CAATOCCATT 1080 
ATTGGGCATA TACCCAGTGG AATGTAAATC ATCCTGCCAT AAAAACACAT GCACATGTAT 1140 
GTTCATTGCA GCACTATTCA CAATAGCAAA GACATGGAAT CAACCTATAT GCCCATCAAT 1200 
AGTAGACTGA ATAAAGAAAA TATGGTACAT ATTCACCACA GAATACTAAG CAGCCATAAA 1260 
AAAAAA 



SeqlDNO: 11 DNA sequence 

Nucleic Add Accession #: NMJJ00582.1 

Coding sequence: 88-990 

1 11 21 31 41 51 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 
GGCATCACCT GTGCCAT ACC AGTTAAACAG GCTGATTCTG GAAGTTCTG A GGAAAAGC AG 180 
CTTTACAACA AATAOCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 
CAGAATCTCC TAGCCCC AC A GACCCTTCCA AGTAAGTCCA ACQ AAAGCCA TGACCACATG 300 
GATGATATGG ATGATGAAGA TGATGATG AC CATGTGGACA GCCAGGACTC CATTGACTCG 360 
AACG ACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATG A GTCTCACCAT 420 
TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 
TTCACTCCAG TTGTCCCCAC AGTAG ACACA T ATGATGGCC GAGGTGATAG TGTGGTTTAT 540 
GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 
GACGAGGACA TCAOCTCACA GATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 
CCCGTTGCCC AGGACCTGAA CGCGOCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 720 
GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 
TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 
CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 
GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 
TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 
CTCAGTTTAT TGGTTGAATG TGT ATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 1200 
CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 
TCATGAATAG AAATTTATGT AG AAGCA AA C AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 
ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 
TAT CTTTTT G TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CT 1111 1 ACT 1500 
GCCT AAAAAA AAAAAAAAAA AAAA 

Sea ID NO: 12 Protein sequence; 
Protein Accession #: NP.000573.1 

1 11 21 31 41 51 

I I 1 I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 1 80 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEB DKHLKFR1SH ELDSASSEVN 



SeqlDNO: 13 DNA sequence 

Nucleic Acid Accession #: NM_001 793 

Coding sequence: 7 1-2560 

1 11 21 31 41 51 

11)111 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 
CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 
CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 
CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCO CTGGGGAAAG TATTCATGGG 240 
CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 
TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 
ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 
TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTG AATCAG CTCAAGTCTA ATAAAGATAG 480 
AGACACCAAG ATTTTCTACA GCATCACGGG GOCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 
CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 
GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 
CCCCATGAAC ATCTCCATCATCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 
GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCT ACCA GGTACTTCTG TGATGCAGGT 780 
GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 
CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 
CACCATCAGC GTCA7XTCCA GTGGCCTGG A CCGGG AAAAA GTCCCTGAGT ACACACTG AC 960 
CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 
GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 
GCCTGAGAAT GCAGTGGGCC ATG AGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1 140 
CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 
TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 
TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 
GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TQAATGAGGC 1380 
ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGG A 1440 
GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 
CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 
TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 
GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 
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ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 
CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG G ACCTGTCTC CCCACACCTC 1 800 
CCCTTTCCAG GCCCAGCTCA CAGATGACTC AG ACATCTAC TGGACGGCAG AGGTCAACGA 1860 
GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTOCTG AAGCAGG ATA CATATGACGT 1920 
GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 
GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 
CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 
GAGAAAGAAG OGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 
CGTCTTCTAC TATGGCGAAG AGGGGGGTGO CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 
GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 
CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATG AAA TCGGCAACTT 2340 
TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GOCCCGCCCT ACGACACCCT 2400 
CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGAOGCCGCG TCCCTGAGCT CCCTCACCTC 2460 
CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 
GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 
GGG ACCAAAC GTCAGGCCAC AG AGCATCTC CAAGGGGTCT CAGTTCCCXX TTCAGCTGAG 2640 
GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 
ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGG ATGGAGG AATGTGGGC A GTTTGACTTC 2760 
AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAG AAGCC 2820 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 
TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 
TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 
TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCOCT 3120 
GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAG AA AAAAA 

ScqIDNO: 14 Protein sequence: 
Protein Accession ft: NP_00 1784.2 



1 11 21 31 41 51 

I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREABVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKJFPSKR ILKRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYS1TGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NGASVEDPMN ISHVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TWRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EG1PTGEPVC VYTAEDPDKE NQKISYRHR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTUD 540 
VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVKATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKHCEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPE WL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Sea TP NO: 15 DNA sequence 

Nucleic Acid Accession V: XM_05 1 860.2 

Cooing sequence: 261-4346 



1 11 21 31 41 51 
1)1111 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 
CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 
AGCTACCACT CCGCTTGCOC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCG ACCAG 1 80 
ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 
AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 
TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 
CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 
ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 
CCATCCACAT CTCAG AGGG A GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 
TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGOCCTCT 600 
GCCCTTTCCA GGGCAATTTC AOCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 
CGGATOCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGGGCT CTTGAGTTGC 720 
ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 
CAGAAGGAGO CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 
TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 
AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 
TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 
AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 
TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 
CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 
CTTTGTCCAG TGAOTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 
TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 
TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 
AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 
GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 
CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 
AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 
TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 
TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGG AG GTTGGGCTTC 1 740 
TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 
ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 
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TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 
AGTACCCGAT TCACTTCCAC CTGGCOGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 
CCACATACAT CAGGG ACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTOCATG 2040 
GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 
TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 
AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 
ACTCCTACCC GGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 
GGATGGCCAA TCOCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 
GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 
GTTATTCAGA GCACATTOCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 
GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGG ACAAGC 2520 
GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 
AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 
CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCG GTTTGCTGAC AATGGCATTG 2700 
GCCTGACCCT GGCCAGTGGT GGAAOCTTCC CGTATGACGA CGGCTCCAAG CAAGAGATAA 2760 
AGAACAGCTT GTTTGTTGGC GAGAGTGGCA ACGTGGGGAC GGAAATGATG GACAATAGGA 2820 
TCTGGGGCCC TGGCGGCTTG GAOCATAGCG GAAGG ACOCT CCCTATAGGC CAGAATTTTC 2880 
CAATTAGAGG AATTCAGTTA TATGATGGCC CCATCAACAT CCAAAACTGC ACTTTCCGAA 2940 
AGTTTGTGGC CCTGGAGGGC CGGCACACCA GOGCCCTGGC CTTCOGCCTG AATAATGCCT 3000 
GGCAGAGCTG CCCCCATAAC AACGTGACCG GCATTGCCTT TGAGGACGTT CCGATTACTT 3060 
CCAGAGTGTT CTTCGGAGAG CCTGGGCCCT GGTTCAACCA GCTGGACATG GATGGGGATA 3120 
AGACATCTGT GTTCCATGAC GTCGACGGCT OOGTGTCCGA GTAOCCTGGC TCCTACCTCA 3180 
CGAAGAATGA CAACTGGCTG GTCCGGCACC CAGACTGCAT CAATGTTCCC GACTGGAGAG 3240 
GGGCCATTTG CAGTGGGTGC TATGCACAGA TGTACATTCA AGCCTACAAG ACCAGTAACC 3300 
TGOGAATGAA GATCATCAAG AATGACTTCC OCAGCCACCC TCTTTACCTG GAGGGGGCGC 3360 
TCACCAGGAG CACCCATTAC CAGCAATACC AACCGGTTGT CACCCTGCAG AAGGGCTACA 3420 
CCATCCACTG GGACCAGACG GCCCCCGCCG AACTCGCCAT CTGGCTCATC AACTTCAACA 3480 
AGGGCGACTG GATCCGAGTG GGGCTCTGCT ACCCGCGAGG CACCACATTC TCCATCCTCT 3540 
CGGATGTTCA CAATCGCCTG CTGAAGCAAA CGTCCAAGAC GGGCX3TCTTC GTGAGGACCT 3600 
TGCAGATGGA CAAAGTGGAG CAGAGCTACC CTGGCAGGAG CCACTACTAC TGGGACGAGG 3660 
ACTCAGGGCT GTTGTTCCTG AAGCTGAAAG CTCAGAACGA GAGAGAGAAG TTTGCTTTCT 3720 
GCTCCATGAA AGGCTGTGAG AGGATAAAGA TTAAAGCTCT GATTCCAAAG AACGCAGGCG 3780 
TCAGTGACTG CACAGCCACA GCTTACCOCA AGTTCACOGA GAGGGCTGTC GTAGACGTGC 3840 
CGATGCOCAA GAAGCTCTTT GGTTCTCAGC TGAAAACAAA GGACCATTTC TTGGAGGTGA 3900 
AGATGG AGAG TTCCAAGCAG CA CTTCTT CC ACCTCTGGAA CGACTTCGCT TACATTGAAG 3960 
TGGATGGGAA GAAGTACCOC AGTTCGGAGG ATGGCATCCA GGTGGTGGTG ATTGACGGG A 4020 
ACCAAGGGCG CGTGGTGAGC CACACGAGCT TCAGGAACTC CATTCTGCAA GGCATACCAT 4080 
GGCAGCTTTT CAACTATGTG GCGACCATCC CTGACAATTC CATAGTGCTT ATGGCATCAA 4140 
AGGGAAGATA CGTCTCCAGA GGCCCATGGA CCAGAGTGCT GGAAAAGCTT GGGGCAGACA 4200 
GGGGTCTCAA GTTGAAAGAG CAAATGGCAT TCGTTGGCTT CAAAGGCAGC TTCCGGCCCA 4260 
TCTGGGTGAC ACTGGACACT GAGGATCACA AAGCCAAAAT CTTCCAAGTT GTGCCCATCC 4320 
CTGTGGTGAA GAAGAAGAAG TTGTGAGGAC AGCTGCCGCC CGGTGCCACC TCGTGGTAGA 4380 
CTATGACGGT GACTCTTGGC AGCAGACCAG TGGGGGATGG CTGGGTCOCC CAGCCOCTGC 4440 
CAGCAGCTGC CTGGGAAGGC OGTGTTTCAG COCTGATGGG CCAAGGGAAG GCTATCAGAG 4500 
ACCCTGGTGC TGCCACCTGC CCCTACTCAA GTGTCTACCT GGAGCCCCTG GGGCGGTGCT 4560 
GGCCAATGCT GGAAACATTC ACTTTCCTGC AGCCTCTTGG GTGCTTCTCT CCTATCTGTG 4620 
CCTCTTCAGT GGGGGTTTGG GGACCATATC AGGAGACCTG GGTTGTGCTG ACAGCAAAGA 4680 
TCCACTTTGG CAGGAGCCCT GACCCAGCTA GGAGGTAGTC TGGAGGGCTG GTCATTCACA 4740 
GATCCCCATG GTCTTCAGCA GACAAGTGAG GGTGGTAAAT GTAGGAGAAA GAGCCTTGGC 4800 
CTT AAGG AAA TCTTTACTCC TGTAAGCAAG AGCCAACCTC ACAGGATTAG GAGCTGGGGT 4860 
AGAACTGGCT ATCCTTGGGG AAGAGGCAAG OCCTGCCTCT GGCCGTGTCC ACCTTTCAGG 4920 
AGACTTTGAG TGGCAGGTTT GGACTTGGAC TAGATGACTC TCAAAGGCCC TTTTAGTTCT 4980 
GAGATTCCAG AAATCTGCTG CATTTCACAT GGTACCTGGA ACCCAACAGT TCATGG ATAT 5040 
CCACTGATAT CCATGATGCT GGGTGCCCCA GCGCACACGG GATGGAGAGG TGAGAACTAA 5100 
TGCCTAGCTT GAGGGGTCTG CAGTCCAGTA GGGCAGGCAG TCAGGTOCAT GTGCACTGCA 5160 
ATGCCAGGTG GAGAAATCAC AGAGAGGTAA AATGGAGGCC AGTGCCATTT CAGAGGGGAG 5220 
GCTCAGGAAG GCTTCTTGCT TACAGGAATG AAGGCTGGGG GCATTTTGCT GGGGGGAGAT 5280 
GAGGCAGCCT CTGG AATGGC TCAGGGATTC AGCCCTCCCT GCOGCTGCCT GCTGAAGCTG 5340 
GTGACTACGG GGTCGCCCTT TGCTCACGTC TCTCTGGCCC ACTCATGATG GAGAAGTGTG 5400 
GTCAG AGGGG AGCAATGGGC TTTGCTGCTT ATG AGCACAG AGGAATTCAG TCCCCAGGCA 5460 
GCOCTGOCTC TGACTCCAAG AGGGTGAAGT CCAGAGAAGT GAGCTCCTGC CTTAGGGCCT 5520 
CATTTGCTCT TCATCCAGGG AACTGAGCAC AGGGGGCCTC CAGGAGACCC TAGATGTGCT 5580 
CGTACTCCCT CGGCCTGGGA TTTCAGAGCT GGAAATATAG AAAATATCTA GCCCAAAGCC 5640 
TTGATTTTAA CAGATGGGGA AAGTGAGCCC CCAAGATGGG AAAGAACCAC ACAGCTAAGG 5700 
GAGGGOCTGG GGAGCCCCAC OCTAGCCCTT GCTGCCACAC CACATTGOCT CAACAACOGG 5760 
CCCCAGAGTG CCCAGGCACT CCTGAGGTAG CTTCTGGAAA TGGGGACAAG TCCCCTCGAA 5820 
GGAAAGGAAA TGACTAGAGT AGAATGACAG CTAGCAGATC TCTTCCCTCC TGCTCCCAGC 5880 
GCACACAAAC CCGCCCTCCC CTTGGTGTTG GCGGTCCCTG TGGCCTTCAC TTTGTTCACT 5940 
ACCTGTCAGC CCAGCCTGGG TGCACAGTAG CTGCAACTCC CCATTGGTGC TACCTGGCTC 6000 
TCCTGTCTCT GCAGCTCTAC AGGTGAGGCC CAGCAGAGGG AGTAGGGCTC GCCATGTTTC 6060 
TGGTGAGCCA ATTTGGCTGA TCTTGGGTGT CTGAACAGCT ATTGGGTCCA CCCCAGTCCC 6120 
TTTCAGCTGC TGCTTAATGC CCTGCTCTCT CCCTGGCCCA GCTTATAGAG AGCCCAAAGA 6180 
GCTCCTGTAA GAGGGAGAAC TCTATCTGTG GTTTATAATC TTGCACGAGG CACCAGAGTC 6240 
TCCCTGGGTC TTGTGATGAA CTACATTTAT CCCCTTTCCT GCCCCAACCA CAAACTCTTT 6300 
CCTTCAAAGA GGGCCTGCCT GGCTCCCTCC ACCCAACTGC ACCCATGAGA CTCX3GTCCAA 6360 
GAGTCCATTC CCCAGGTGGG AGCCAACTGT CAGGG AG GTC TTTCCCAOCA AACATCTTTC 6420 
AGCTGCTGGG AGGTGACCAT AGGGCTCTGC TTTTAAAGAT ATGGCTGCTT CAAAGGCCAG 6480 
AGTCACAGG A AGGACTTCTT CCAGGGAG AT TAGTGGTG AT GG AGAGGAG A GTTAAAATG A 6540 
CCTCATGTCC TTCTTGTCCA CXiGTTTTGTT G AGTTTTCAC TCTTCTAATG CAAGGGTCTC 6600 
ACACTGTGAA CCACTTAGGA TGTGATCACT TTCAGGTGGC CAGGAATGTT GAATGTCTTT 6660 
GGCTCAGTTC ATTTAAAAAA GATATCTATT TGAAAGTTCT CAGAGTTGTA CATATGTTTC 6720 
ACAGTACAGG ATCTGTACAT AAAAGTTTCT TTCCTAAACC ATTCACCAAG AGCCAATATC 6780 
TAGGCATTTT CTTGGTAGCA CAAATTTTCT TATTGCTTAG AAAATTGTCC TCCTTGTTAT 6840 
TTCTGTTTGT AAGACTTAAG TGAGTTAGGT CT TTAAGG AA AGCAACGCTC CTCTGAAATG 6900 
CTTGTCTTTT TTCTGTTGCC GAAATAGCTG GTCCTTTTTC GGGAGTTAGA TGTATAGAGT 6960 
QTTTGTATGT AAACATTTCT TGTAGGCATC ACCATGAACA AAGATATATT TTCTATTTAT 7020 
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TTATTATATG TOCACTTCAA GAAQTCACTG TCAGAGAAAT AAAGAATTGT CTTAAATGTC 



Seq ID NO: 16 Protein seouencc: 
* Protein Accession ft XP 051 8602 

1 U 21 31 41 SI 

I ! I I I I 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 
GKTLLLTSSA TVYSIHISEG GKLVDCDHDE PTVLRTRHIL IDNGGELHAG SALCPFQGNF 120 
THLYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 1 80 
ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 
SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 
KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 
IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 
STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHGE 480 
EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHKF ALGFKAAHLE 540 
GTBLKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLL1 600 
KD WGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYT 660 
PKPRQDCNAV STFWM ANPNN NLINCAAAGS EETGPWFIFH HVPTGPSVGM YSPGYSEHIP 720 
LGKFYNNRAH SNYRAGMIED NGVKTTEASA KDKRPFLSU SARYSPHQDA DPLKPREPAI 780 
IRHFIAYKNQ DHGAWLRGGD VWLDSCRFAD NG1GLTLASG GTFPYDDGSK QEDCNSLFVG 840 
ESGNVGTEMM DNRIWGPGGL DHSGRTLPIG QNFPIRGIQL YDGPINIQNC TFRXFVALEG 900 
RHTS ALAFRL NNAWQSCPHN NVTGIAFEDV PITSRVFFGE PGPWFNQLDM DGDKTSVFHD 960 
VDGSVSEYPG SYLTKNDNWL VRHPDONVP DWRGAICSGC YAOMYIQAYK TSNLRMKIDC 1020 
NDFPSHPLYL EGALTRSTHY QQYQPWTLQ KGYTIHWDQT APAELAIWLI NFNKGDWIRV 1080 
GLCYPRGTTF SILSDVHNRL UCQTSKTGVF VRTLQMDKVE QSYPGRSHYY WDEDSGLLFL 1 140 
KLKAQNEREK FAFCSMKGCE RIKIKALIPK NAG VSDCTAT AYPKFTERAV VDVPMPKKLF 1200 
GSQLKTKDHF LEVKMESSKQ HFFHLWNDFA YEVDGKKYP SSEDGIQVW IDGNQGRWS 1260 
HTSFRNSILQ GIPWQLFNYV ATIPDNSIVL MASKGRYVSR GPWTRVLEKL GADRGLKLKE 1320 
QMAFVGFKGS FRPIWVTLDT EDHKAKIFQV VPIPWKKKK L 

Seq ID NO: 17 pNA, sequence 

Nucleic Acid Accession #: NM015515.1 

Coding sequence: 61-1329 

1 11 21 31 41 SI 
I I I I I I 

AGTTCTGCGG TGCCAGGGAG TGGAGCAGAG CTCAGCCCCG TOCCAAACAC AGATGGGACC 60 
ATGAACTCCG GACACAGCTT CAGCCAGACC CCCTCGGCCT CCTTCCATGG CGCCGGAGGT 120 
GGCTGGGGCC GGCCCAGGAG CTTCCCCAGG GCTCCCACCG TCCATGGCGG TGCGGGGGGA 180 
GCCCGCATCT CCCTGTCCTT CACCACGCGG AGCTGCCCAC CCCCTGGAGG GTCTTGGGGT 240 
TCTGGAAGAA GCAGCCCCCT ACTAGGCGGA AATGGGAAGG CCACCATGCA GAATCTCAAC 300 
GACCGCCTGG CCTCCTACCT GGAGAAGGTT CGCGCCCTGG AGGAGGCCAA CATGAAGCTG 360 
GAAAGCCGCA TCCTGAAATG GCACCAGCAG AGAGATCCTG GCAGTAAGAA AGATTATTCC 420 
CAGTATGAGG AAAACATCAC ACACCTGCAG GAGCAGATAG TGGATGGTAA GATGACCAAT 480 
GCTCAGATTA TTCTTCTCAT TGACAATGCC AGGATGGCAG TGGATGACTT CAACCTCAAG 540 
TATGAAAATG AACACTCCTr TAAGAAAGAC TTGGAAATTG AAGTCGAGGG CCTCCGAAGG 600 
ACCTTAGACA ACCTGACCAT TGTCACAACA GACCTAGAAC AGGAGGTGGA AGGAATGAGG 660 
AAAGAGCTCA TTCTCATGAA GGAGCACCAT GAGCAGGAAA TGGAGGAGCA TCATGTGCCA 720 
AGTGACTTCA ATGTCAATGT GAAGGTGGAT ACAGGTCCCA GGGAAGATCT GATTAAGGTC 780 
CTGGAGGATA TGAGACAAGA ATATGAGCTT ATAATAAAGA AGAAGCATCG AGACTTGGAC 840 
ACTTGGTATA AAGAACAGTC TGCAGCCATG TCCCAGGAGG CAGCCAGTCC AGCCACTGTG 900 
CAGAGCAGAC AAGGTGACAT CCACX}AACTG AAGCGCACAT TCCAGGCCCT GGAGATTGAC 960 
CTGCAGGCAC AGTACAGCAC GAAATCTGCT TTGGAAAACA TGTTATCCGA GACCCAGTCT 1020 
CGGTACTCCT GCAAGCTCCA GGACATGCAA GAGATCATCT CCCACTATGA GGAGGAACTG 1080 
ACGCAGCTAC GCCACGAACT GGAGCGGCAG AACAATGAAT ACCAAGTGCT GCTGGGCATC 1140 
AAAACCCACC TGGAGAAGGA AATCACCACG TACCGACXJGC TCCTGGAGGG AGAGAGTGAA 1200 
GGGACACGGG AAGAATCAAA GTCGAGCATG AAAGTGTCTG CAACTCCAAA GATCAAGGCC 1260 
ATAACCCAGG AGACCATCAA CGGAAGATTA GTTCTTTGTC AAGTGAATGA AATCCAAAAG 1320 
CACGCATGAG ACCAATG AAA GTTTCCGCCT GTTGTAAAGT CTATTTTCOC CCAAGGAAAG 1380 
TCCTTGCACA GACACCAGTG AGTGAGTTCT AAAAGATACC CTTGGAATTA TCAG ACTCAG 1440 
AAACTTTTAT U11U111I CTGTAACAGT CTCAOCAGAC TTCTCATAAT GCTCTTAATA 1500 
TATTGCACTT TTCTAATCAA AGTGCGAGTT TATGAGGGTA AAGCTCTACT TTCCTACTGC 1560 
AGCCTTCAG A TTCTCATCAT TTTGCATCTA TTTTGTAGCC AATAAAACTC CGCACTAGC 



Seq ID NO: 18 Protein sequence: 
Protein Accession ft NP_056330.1 

1 11 21 31 41 51 

1 1 I 1 1 1 

MNSGHSFSQT PSASFHGAGG GWGRPRSFPR APTVHGGAGG ARISLSFTTR SCPPPGGSWG 60 
SGRSSPLLGG NGKATMQNLN DRLASYLEKV RALEEANMKL ESRUKWHQQ RDPGSKKDYS 120 
QYEENITHLQ EQIVDGKMTN AQHLLIDNA RMAVDDFNLK YENEHSFKKD LEIEVEGLRR 180 
TLDNLTIVTT DLEQEVEGMR KELILMKEHH EQEMEEHHVP SDFNVNVKVD TGPREDUKV 240 
LEDMRQEYEL IDCKKHRDLD TWYKEQSAAM SQEAASPATV QSRQGDIHEL KRTFQALEID 300 
LQAQYSTKSA LENMLSETQS RYSCKLQDMQ EUSHYEEEL TQLRHELERQ NNEYQVLLGI 360 
KTHLEKEITT YRRLLEGESE GTREESKSSM KVSATPKIKA ITQBTINGRL VLCQVNEIQK 420 
HA 

Seq ID NO: 19 PNA sequence 

Nucleic Acid Accession #: see Table 25 & 25 A for complete list 

1 11 21 31 41 51 

I I I I I I 
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TTTTTTTTTT TTAAAAAAAA GAGGCTTGGT AAGTTTTTOA TACTTAGTTG ACTTTTAGCA 60 
TTATCCAGCA TTTGTATTAT G AACCAGTGA GTACTGTAAT TTTTCTTTCC CTTTCAGAAA 120 
GACTCAAAGG G AACATATAA ATGTTTCCTA TTTTTNNNNN NNNNNNNNNN NNNNNNNNNN 180 
NNNNACCCAT CGTGCGATGA TCNNNNNNNN NNNNNNNNNN NNNNNTTGGG ATCCAGTTTC 240 
AAATAAGGTA TGGGAAAAAC AGATGTTTTC ATTATCGCCA CTTAATCCTT ACTTCCGATT 300 
ATAATTATAC ATGTTTGGCT GTAATAACT A TACTAAAGCA TGCTTGTGAA AGTAGACTTC 360 
TACAAGGACA GAAAACCCAC AACAACAAAG ATCGATCACG AAAGACAAGG CATATTCATT 420 
CATTAATTTA CTTCTCTTAG ACCCGGGACA TGTGGG ACAA ATACTTTTGT CCTCATGGAT 480 
GGCTTGATAA TTTATTTATA TGTTCTAGAG TCTGAGGATT TTCTTTCAGT GGCAG ACAAC 540 
AAAGGATGTT ACAATTTACT TCAAAATAAT ACAATCATGG TTTAATTTAC AGTGTAAATC 600 
CATAACTATT TTATAGAG AT GGATTATCAT ACATGGGATT ATAAAAATAA CTT ACCCATA 660 
TGCTTGCAAA ATAGACTTTT CCTATTGGGA GGAACATCTT TTAACCTAAA ACGGATTTAT 720 
TTCAGATGAA TTAGACAGTA CATTTTTCAG GAGAACCAGC CTTACTGGAT GATCTTTTGT 780 
CAGGTTTGGA GGCCTCTTCT TTGTCTTTGC AACCATAACC CCTTTTCAGC TGAAGACCAC 840 
TGGCCTTCAA CCCAAGCCAG GAGTTTGGCT CAAATGA 

SeqroNfr 20 DN^ sequence 

Nucleic Acid Accession #: D3205 1 .1 

Coding sequence: 72-1373 



1 11 21 31 41 51 

I I I I I I 

GAATTCGAAC CAGGTGGCCA COCGGTGTCG GTTTCATTTT CCTTTGGAAT TTCTGCTTTA 60 
CAGACAGAAC AATGGCAGCC CGAGTACTTA TAATTGGCAG TGGAGGAAGG GAACATACGC 120 
TGGCCTGGAA ACTTGCACAG TCTCATCATG TCAAACAAGT GTTGGTTGCC CCAGGAAACG 180 
CAGGCACTGC CTGCTCTGAA AAGATTTCA A ATACCGCCAT CTCAATCAGT GACCACACTG 240 
CCCTTGCTCA ATTCTGCAAA GAGAAGAAAA TTGAATTTGT AGTTGTTGGA CCAGAAGCAC 300 
CTCTGGCTGC TGGGATTGTT GGGAACCTGA GGTCTGCAGG AGTGCAATGC TTTGGCCCAA 360 
CAGCAGAAGC GG CTCAGTTA GAGTCCAGCA AAAGGTTTGC CAAAGAGTTT ATGGACAGAC 420 
ATGGAATCCC AACCGCACAA TGGAAGGCTT TCACCAAACC TGAAGAAGCC TGCAGCTTCA 480 
TTTTGAGTGC AGACTTCCCT GCTTTGGTTG TGAAGGCCAG TGGTCTTGCA GCTGGAAAAG 540 
GGGTGATTGT TGCAAAGAGC AAAGAAGAGG CCTGCAAAGC TGTACAAGAG ATCATGCAGG 600 
AGAAAGCCTT TGGGGCAGCT GGAGAAACAA TTGTCATTGA AGAACTTCTT GACGGAGAAG 660 
AGGTGTCX3TG TCTGTGTTTC ACTGATGGCA AGACTGTGGC OOCCATGCCC CCAGCACAGG 720 
ACCATAAGCG ATTACTGGAG GGAGATGGTG GCCCTAACAC AGGGGGAATG GGAGCCTATT 780 
GTOCAGCOCC TCAGGTTTCT AATGATCTAT TACTAAAAAT TAAAGATACT GTTCTTCAGA 840 
GGACAGTGGA TGGCATGCAG CAAGAGGGTA CTCCATATAC AGGTATTCTC TATGCTGGAA 900 
TAATGCTGAC CAAGAATGGC CCAAAAGTTC TAGAGTTTAA TTGCCGTTTT GGTGATCCAG 960 
AGTGCCAAGT AATOCTCCCA CTTCTTAAAA GTGATCTTTA TGAAGTGATT CAGTCCACCT 1020 
TAGATGGACT GCTCTGCACA TCTCTGCCTG TTTGGCTAGA AAACCACACC GCCCTAACTG 1080 
TTGTCATGGC AAGTAAAGGT TATCCTGGAG ACTACACCAA GGGTGTAGAG ATAACAGGGT 1 140 
TTCCTGAGGC TCAAGCTCTA GGACTGGAGG TGTCCCATGC AGGCACTGCC CTCAAAAATG 1200 
GCAAAGTAGT AACTCATGGG GGTAGAGTTC TTGCAGTCAC AGCCATCCGG GAAAATCTCA 1260 
TATCAGCCCT TGAGGAAGCC AAGAAAGGAC TAGCTGCTAT AAAGTTTGAG GGAGCAATTT 1320 
ATAGGAAAGA CATCGGCTTT CGTGCCATAG CTTTCCTCCA GCAGCCCAGG TAAAACTCTA 1380 
AGCAAGTTAG CTGTAGTGCC ATTTCAOAAA CTGGCCTAAA TGGCTATGTA GAACATTCCA 1440 
TTAACCCTAT AAGTCATTCA GTATTCTTTT CTCTCTGTGG GAGTGATACA GTCTTGGTTT 1 500 
GTATTTTGTT TGAATCAAAA CTGGTTATAG CAATACTCAA ATGGAAAAAA CTTCATGATA 1560 
GCGTAAGTTT GGAAAGTTTA GCAAAATCAC AGTGGTACTG ATTTTTATTT GTTTTCTATT 1620 
TTTTTTATTT TATATTTTTA ATTTTTTTAA CAGGGTCTTC CTCTCTCGCC CAAGTTCTCA 1680 
TGCCTCAGCC TCCCAAATAG CTGGGACTAC AGGCACAGGC CACCACACCT GGCTAATTTT 1740 
TTTGTATTTT TTGTGGAGAT GGGGTTCACC ATGTTGCCAA GGCCAGTCTG AAAGCCTGGG 1 800 
CTCAAGTGAT CCTCCTGCTT TGGCCTCCCA AAATGCTGGG ACTATAGGCA TGAGGCX3CTG I860 
CACTTGGCCT GATACTGATT TTTATTCCTT GCGTTATCAC ATAGTGTTGT ATTTGAAACA 1920 
TAGTTCATGG TTTTATCAAA GAACTGAAGA TGAGAATACT GGTCATCTAA CTTTGTAATT 1980 
TGATTTGATT ATACTGTAAA GTTTGACAGT CCCATTTTAA CCTGCGTTTG TATCTATTAC 2040 
TAAAATGTAT TTTTTGACCT CTTACTGATT CATGGTTGGT ATGTACAAAC TGTTGACTTG 2100 
TAAAATCAAT AAAGTCTTAG TTGG 

Sen ID NO: 21 Protein sequence: 
Protein Accession #: BAA06809.1 



I 11 21 31 41 51 

I I I I I I 

MAARVLHGS GGREHTLAWK LAQSHHVKQV LVAPGNAGTA CSEKISNTAI SISDHTALAQ 60 
FCKEKKIEFV WGPEAPLAA GIVGNLRS AG VQCFGPTAEA AQLESSKRFA KEFMDRHGIP 120 
TAQWKAFTKP EEACSFILSA DFPALWKAS GLAAGKG VIV AKSKEEACKA VQEIMQEKAF 1 80 
GAAGETIVIE E1XDGEEVSC LCFTDGKTVA PMPPAQDHKR LLEGDGGPNT GGMGAYCPAP 240 
QVSNDLLLKI KDTVLQRTVD GMQQEGTPYT GZLYAGIMLT KNGPKVLEFN CRFGDPECQV 300 
ILPLLKSDLY EVIQSTLDGL LCTSLPVWLE NHTALTWMA SKGYPGDYTK GVEITGFPEA 360 
QALGLEVSHA GTALKNGK W THGGRVLAVT AIRENLISAL EEAKKGLAA1 KFEGAIYRKD 420 
IGFRAIAFLQ QPR 



Seq ID NO: 22 DNA sequence 

Nucleic Add Accession #: EOS cloned 

Coding sequence: 1-2424 



1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTGC TGTTTTCCAG AGTGCCCCCA 60 
TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 
AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 
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AGCGTCOGGA AAGQOAOCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 
CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 
CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 
ATGGTTTTCA AAGG AGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 
GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 
AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 
TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 
AGAGGGCAGC AOGTGCTGTT GGCTGAGCAG GTGGAGGATG CCAOCAACGG CCTCTTCAGC 660 
ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 
CCCTGTGAGC ACAGGACGCT GGAGATGGTC OGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 
AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 
AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 500 
TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 
CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 
AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 
CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 
CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 
CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTOCGTGG TGGCCCCACC 1260 
CTGACGGGCA GTGCCTTGOG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 
GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 
GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 
GAGGCCGTGC GGGCAGAGCT GGAGGAG ATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 
TCGGATOCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 
CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 
TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTOCAG 1680 
TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 
ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGOGA TGCTGCGGGC CATTAGCCAG 1800 
GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1 860 
GTGATGACOG TCCAGAGGGG TGCCCGGCCT GGTGTCCOCA AAGCTGTGGT GGTGCTCACA 1920 
GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 
TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 
CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCG AOC TGCGGTACCA CCAGGACGTG 2100 
CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 
TGCATGAATG AGGGCAGCFG CGTGCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 
GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 
CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 2340 
CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 
AATGTCTGTG CCCCAGGTCC TTAG 

Seq ID NO: 23 protein sequence: 
Protein Accession #: EOS cloned 



1 11 21 31 41 51 
I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 
SVGKGSFERS KHFAJTVCDG LDKPERVRV GAFQFSSTPH LEPPLDSFST QQEVKARHCR 120 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERG VTV 180 
FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 
PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 
LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 
SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 
SVLWGVGPV LSEGLRRLAG PRDSUHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 

Seq ID NO: 24 DNA sequence 

Nucleic Acid Accession »: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

1 I I I I i 

AGGTCGGCTG GTTATCGGGA GTTGGAGGGC TGAGGTCGGG AGGGTGGTGT GTACAGAGCT 60 
CTAGGACTCA CGCACCAGGC CAGTCGCGGG TTTTGGGCCG AGGCCTGGGT TACAAGCAGC 120 
AAGTGCGCGG TTGGGGCCAC TGCGAGGCCG TTTTAGAAAA CTGTTTAAAA CAAAGAGCAA 180 
TTGATGGATA AATCAGGAAT AGATTCTCTT GACCATGTGA CATCTGATGC TGTGGAACTT 240 
GCAAATCGAA GTGATAACTC TTCTGATAGC AGCTTATTTA AAACTCAGTG TATCCCTTAC 300 
TCACCTAAAG GGG AGAAAAG AAACCOCATT CGAAAATTTG TTCGTACACC TGAAAGTGTT 360 
CACGCAAGTA TTCATCAAGT GACTCATCTT TTGAACCAGT ACCATTGACT ATAAAAGCTA 420 
TTTTTGAAAG ATTCAAGAAC AGGAAAAAGA GATATAAAAA AAAGAAAAAG AGGAGGTACC 480 
AGCCAACAGG AAGACCACGG GGAAGACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 540 
TAGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT CCCATTTTTA GAATCAGAGA 600 
ATGAAAAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAGAGGAT 660 
TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAG AATCA TTGAAGCAAA 720 
TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 
ATGATGATGG ATCCATTTCT OCTATTGAGG AGTCAACGCT TTTATCTTGA GGACATGGTG 840 
TCTGGAGTTA AAGGTATTGG CATACTCCAC ACATCTGTAC CATTCTTGAG TGATCGCTTA 900 
GGAATGAATG TGATTTGGAC TCATTCATGT ATGAGAGTAA GCAATGCTTT TTTTTCCAGG 960 
gtgtcaaatt GAGAACCAGG TAGATCCCCA CCACCTACAG TAAAAAGGAC CCTAAAGTAA 1020 
ATTGGTTGAA GAAATTAGAT CCCAAAGATT CTTGGTGAAT TTTGAAGTCT TCATCAGTAT 1080 
ATCCATATTA AAAOGAGATG ACAGAAGCCA AAGTAATTAT GGGCTGACAG GACAACTGGA 1140 
TCAGTTTCAT TAAAAAGGGC AAACTTGAAG ATAAATCTTT TGACTCCAGC TCTTTAGAGG 1200 
ATCTAAAGTG AOCTTGATGG ACAGTGGAAG AAATCACAAC ATGGAATTCC TCGAATAACA 1260 
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ATTTATTGAC TTTAAATAAT TTTGTCTAAT GCTACATATA CACAATTAAA AAACCTTTAC 1320 
ACTATTTCTA GAAAGTCAGC ATGTATTTTT GGCTCGAAGT TTCTCTAGTG TTTTCTGTGG 1380 
AAGGAATAAA AATTTGAGGT TTCAATACAA AAACAAAACA AACAACACGA AACACGAAAA 1440 
ACAATCTGTT GTGCGGCGCC CCTGGGCCCC TTGAGAGAAA ACTTTTTAGA ACCCCTTTTG 1500 
CGTTGTGGCG GCCCGGGOGC CCCACAGTTG GGTTTAGGTG GGCACCCTTG TGTCTACAAG 1560 
TGGTGTCTCC CCAAGAGAGA GAACACCTCC GGGGTCAAGC GGACAACAAG AGTGCGTCGT 1620 
GAGGACTCTT CACCCAAAGT ATATAAAACC CGCOCCGCGG GGGAACCACC GGCOGCTTTT 1680 
CTGTAGACAC AACCCCCACA GTGGGAACCT CTGAGGGCGC ACACACAGGG CGAGCCTTAT 1740 
CAACAAGGGG TGOCCAACAG AAACCCCGAG TTAAAAATCG 



Scq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: BO00 1972. 1 

Coding sequence: 183-1019 

1 11 21 31 41 51 

I I I I I I 

GGTCGGCTGG TTATCGGGAG TTGGAGGGCT GAGGTCGGGA GGGTGGTGTG TACAGAGCTC 60 
TAGGACTCAC GCACCAGGCC AGTCGCGGGT TTTGGGCCGA GGCCTGGGTT ACAAGCAGCA 120 
AGTGCGCGGT TGGGGCCACT GCGAGGCCGT TTTAGAAAAC TGTTTAAAAC AAAGAGCAAT 180 
TGATGGATAA ATCAGGAATA GATTCTCTTG ACCATGTGAC ATCTGATGCT GTGGAACTTG 240 
CAAATCGAAG TGATAACTCT TCTGATAGCA GCTTATTTAA AACTCAGTGT ATCCCTTACT 300 
CACCTAAAGG GGAGAAAAGA AACCCCATTC GAAAATTTGT TCGTACACCT GAAAGTGTTC 360 
ACGCAAGTGA TTCATCAAGT GACTCATCTT TTGAAOCAAT ACCATTGACT ATAAAAGCTA 420 
TTTTTGAAAG ATTCAAQAAC AGGAAAAAGA GATATAAAAA AAAGAAAAAG AGGAGGTACC 480 
AGCCAACAGG AAGACCACGG GGAAGACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 540 
TAGATAAG AA GAAACAATTT AG AAGCAGAG GATCTGGCTT CCCATTTTTA G AATCAG AGA 600 
ATGAAAAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAGAGGAT 660 
TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAGAATCA TTGAAGCAAA 720 
TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 
ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAACAGC AGAGGATGAG GATGCAACAC 840 
ATCTTGAAGA TAACGAATGT GATATCAAAT TGGCAGGGGA TAGTTTCATA GTAAGTTCTG 900 
AATTCCCTGT AAGACTGAGT GTATACTTAG AAGAAGAGGA TATTACTGAA GAAGCTGCTT 960 
TGTCTAAAAA GAGAGCTACA AAAGCCAAAA ATACTGGACA GAGAGGCCTG AAAATGTGAC 1020 
AGGATCATGA ATGTCAAAGG CTTTTATCTT GAGAACATGG TGTCTGGAGT TAAAGGACTA 1080 
TTGTTAGATC TGTGGGAAGG AATTACAAGA CAGTTGCTAA AAGTTTGAAA AAGACGGTTG 1140 
CTAAACGTTA TGAAAAACCA GATAATCTAC TTTTTTACCT TAGGTATTGG CATACTCCAC 1200 
ACATCTGTAC CATTCTTGAG TGATCGCTTA GGAATGAATG TGATTTGAAC TCATTCATGT 1260 
TGAGAGGGTG TCAAATTGAG AACCAGGTAG ATCCCCACCA CCTACAGTAA AAAGGACCCT 1320 
AAAGTAAATT GGTTGAAGAA ATTAGATCCC AAAGATTCTT GGTGAATTTT GAAGTCTTCA 1380 
TCAGTATATC CATATTAAAA CGAGATGACA GAAGCCAAAG TAATTATGGG CTGACAGGAC 1440 
AACTGGATCA GTTTCATTAA AAAGGGCAAA CTTGAAGATA AATCTTTTGA CTCCAGCTCT 1500 
TTAGAGGATC TAAAGTGACC TTGATGGACA GTGGAAGAAA TCACAACATG GAATTCCTCG 1560 
AATAACAATT TATTGACTTT AAATAATTTT GTCTAATGCT ACATATACAC AATTAAAAAA 1 620 
CCTTTACACT AAAAAAAAAA AAAAAA 

Scq ID NO: 26 Protein sequence: 
Protein Accession #: AAH01972.1 

1 11 21 31 41 51 

1)1111 

MDKSGIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSF EPIPLTIKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 120 
DKKKQFRSRG SGFPFLESEN EKNAPWRKU TFEQAVARGF FNYIEKLK.YE HHLKESLKQM 180 
NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 240 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: AK027016 

Coding sequence; 207-1043 

1 U 21 31 41 51 

I I I I I I 

CTTTTCTTCC GCACGGTTGG AGGAGGTCGG CTGGTTATCG GGAGTTGGAG GGCTGAGGTC 60 
GGGAGGGTGG TGTGTACAGA GCTCTAGGAC TCACGCACCA GGCCAGTCGC GGATTTTGGG 120 
CCGAGGCCTG GGTTACAAGC AGCAAGTGCG CGGTTGGGGC CACTGCGAGG CCGTTTTAGA 180 
AAACTGTTTA AAACAAAGAG CAATTGATGG ATAAATCAGG AATAGATTCT CTTGACCATG 240 
TGACATCTGA TGCTGTGGAA CTTGCAAATC GAAGTGATAA CTCTTCTGAT AGCAGCTTAT 300 
TTAAAACTCA GTGTATCCCT TACTCACCTA AAGGGGAGAA AAGAAACCCC ATTCGAAAAT 360 
TTGTTCGTAC ACCTGAAAGT GTTCACGCAA GTGATTCATC AAGTGACTCA TCTTTTGAAC 420 
CAATACCATT GACTATAAAA GCTATTTTTG AAAGATTCAA GAACAGGAAA AAGAGATATA 480 
AAAAAAAGAA AAAG AGGAGG TACCAGCCAA CAGGAAGACC ACGGGGAAGA CCAGAAGGAA 540 
GGAGAAATCC TATATACTCA CTAATAGATA AGAAGAAACA ATTTAGAAGC AGAGGATCTG 600 
GCTTCCCATT TTTAGAATCA GAGAATGAAA AAAACGCACC TTGGAGAAAA ATTTTAACGT 660 
TTGAGCAAGC TGTTGCAAGA GGATTTTTTA ACTATATTGA AAAGCTGAAG TATGAACACC 720 
ACCTGAAAG A ATCATTG AAG CAAATG AATG TTGGTGAAGA TTTAGAAAAT GAAGATTTTG 780 
ACAGTCGTAG ATACAAATTT TTGGATGATG ATGGATCCAT TTCTCCTATT GAGG AGTCAA 840 
CAGCAGAGGA TGAGGATGCA ACACATCTTG AAGATAACGA ATGTGATATC AAATTGGCAG 900 
GGGATAGTTT CAT AGTAAGT TCTG AATTCC CTGTAAGACT GAGTGTAT AC TTAG AAG AAG 960 
AGGATATTAC TGAAGAAGCT GCTTTGTCTA AAAAGAGAGC TACAAAAGCC AAAAATACTG 1020 
GACAGAGAGG CCTGAAAATG TGAC AGGATC ATGAATGTCA AAGGCTTTTA TCTTGAGAAC 1080 
ATGGTGTCTG GAGTTAAAGG TATTGGCATA CTCCACACAT CTGTACCATT CTTGAGTGAT 1 140 
cgcttaggaa TGAATGTGAT ttgaactcat TCATGTTGAG AGGGTGTCAA ATTGAGAACC 1200 
AGGTAGATCC CCACCACCTA CAGTAAAAAG GACCCTAAAG TAAATTGGTT GAAGAAATTA 1260 
GATCCCAAAG ATTCTTGGTG AATTTTGAAG TCTTCATCAG TATATCCATA TTAAAACGAG 1320 
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ATGACAGAAG CCAAAGTAAT TATGGCAAGT AATGGTTTTT ATCTTAACTA TAAGTTATTT 1380 
GCTCAAGGGT GTAATGGTCA TTACCAAGGC TTTTAGAATG CAGTTTCTCA TTTGCTGTGG 1440 
ACATGACCAT AAAAAAAAAT TTCCCAGTAG GTTTTCTATC TGCTACGTTG CTAGCAATCA 1500 
GCTTATTGGG AACAGTTGAT TAACTGTAAT AGAAATGCAA TACAAATAAA ATGTGAACCA 1560 
CATGTGATTT TTCTTTAAAA TCAGTGAGAT TTGAAAATTC TCCTAGATCT CTTGAATCAT 1620 
GCAAATTTGC TTTGCCTTTA TATTGTAACC CTTGTGGGTT GCTAATAACC AAGCAGTTTG 1680 
TAGTAGAGTT AACTCAGGCT CGTTCTAGGG ACTCATTCAT GTTCACTCAC TGTACACTCA 1740 
TCTCTGGAAA TGTAAAATTT ACTTTTATAC TATTGTTATG TAGGGCTGAC AGGACAACTG 1800 
GATCAGTTTC ATTAAAAAGG TATGTATGCA TTAGAAAAGA CATTTGTATG GGTCATTTCA 1860 
AAGAGGGCTT ATGAGGCTGT GAAACCCAGA GCTCTTAACG CTGTGACCAA AGATGGAAGT 1920 
TCTCTATAGG AAGCCATAGC ACTCCTAATG TTTGGTGCTA TGTTTTCCTG AGGAGATATA 1980 
AAACGTAATA ATCCATGATT GTTGCCATGT OAGAGTTTTA AAGGTTAATC AAAATTTCTC 2040 
TTCTTCAGGG CAAACTTGAA GATAAATCTT TTGACTCCAG CTCTTTAGAG GATCTAAAGT 2100 
GACCTTGATG GACAGTGGAA GAAATCACAA CATGGAATTC CTCGAATAAC AATTTATTGA 2160 
CTTTAAATAA TTTTGTCTAA TGCTACATAT ACACAATTAA AAAACCTTTA CACTATTTCT 2220 
AGAAAGTCAG GATGTATTTT TGGCTCGAAG TTTCTCTAGT GTTTTCTGTG GAAGGAATAA 2280 
AAATTTGAGT TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 28 Protein sequence: 
Protein Accession ft BAB15628.1 

I II 21 31 41 SI 
I I ! I I I 

MDKSOIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSF EPIPLTIKA1 FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSU 120 
DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQAVARGF FNYIEKLKYE HHLKESLKQM 180 
. NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDHCL AGDSFIVSSE 240 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



Sep ID NO; 29 DNA sequence 

Nucleic Acid Accession ft NMJXM289J 

Coding sequence: 493-1695 

1 11 21 31 41 51 

I I I I I I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 60 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 120 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACX5 CTGGCGGATG TGCGAGCGAG 180 
GAGAATGGGG TACTAAGAGA AAAGCAOGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 240 
AATGAAGAAA GGGTGTCAGC OCAGAAGGAG AACTCACTTC AGCAG AATGA TGATGATGAA 300 
AACAAAATAG CAG AGAAACC TGACTGGGAG GCAGAAAAG A CCACTGAATC TAG AAATG AG 360 
AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 420 
TCACAGOCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 480 
ATCAGTGATG GCATG AATTC TTCAGCACAT TATCATGTAA ACTTC AGCCA GGCTATAAGT 540 
CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 600 
CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 660 
AATCCTGAGC AAACCCITCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 720 
CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 780 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 840 
CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 900 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 960 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTAOCCA 1020 
GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 1080 
TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 1 140 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAG AAGATAA GGAGTAGATA CCTTGAAGAC 1200 
ACAGATAGAA ACTTG AGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 1260 
GTAG ATG AAA TTGTCGGCAT GOCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 1320 
CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 1380 
GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 1440 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 1500 
ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 1560 
GATGACCAAG GTAGGCCAGT CAATOOCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 1620 
AGTATCTTG A TAGTACCCAA AG AACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 1 680 
GGAAAGAGAA AGTGAGAAGA AACTG AAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 1740 
GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 1800 
CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 1860 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 1920 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 1980 
CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAG AG CAAACACTGG 2040 
GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 2100 
AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 2160 
AAATTTTGCT TTCT 



R^q TP NO: 30 Protein sequence: 
Protein Accession ft NP_0O428O 

1 U 21 31 41 51 

I I I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 60 
TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 120 
PDSDSGLSLD SSHNNTSVDC SNSSHSVCDE GAIGVCTDHE SSSHHDLEGA VGGYYPEPSK 180 
LCHLDQSDSD FHGDLTFQHV FHNHTVHLQP TAPESTSEPF PWPGKSQKIR SRYLEDTDRN 240 
LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS MLSRYYLTDL QVSURDERR RGKNKVAAQN 300 
CRKRKLDUX NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 360 
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RPVNPNHYAL QCTHDGSILI VPKELVASGH KKBTQKGKRK 



Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: NM 033260.1 

Coding sequence; 1-1208 

1 11 21 31 41 51 
I I 1 I I I 

ATGAAGTTGG AGGTGTTCGT CCCTCGCGCG GCCCAOGGGG ACAAGCAGGG CAGTGACCTG 60 
GAGGGCGCGG GCGGCAGCGA CGCGCCGTCC COGCTGTCGG OGGCGGGAGA CGACTCCCTG 120 
GGCTCAGATG GGGACTGCGC GGCCAAGCCG TCCGCGGGCG GCGGCGCCAG AGATACGCAG 180 
GGCGACGGCG AACAGAGTGC GGGAGGCGGG CCGGGCGCGG AGGAGGCGAT OCCGGCAGCA 240 
GCTGCTGCAG CGGTGGTGGC GGAGGGCGCG GAGGCCGGGG CGGCGGGGCC AGGCGCGGGC 300 
GGCGCGGGGA GCGGCGAGGG TGCACGCAGC AAGCCATATA CGCGGCGGCC CAAGOCCOCC 360 
TACTCGTACA TCGCGCTCAT CGCCATGGCC ATCCGCGACT CGGO GGGCGG GCGCTTGACG 420 
CTGGCGGAGA TCAACGAGTA CCTCATGGGC AAGTTCCCCT TTTTCCGCGG CAGCTACACG 480 
GGCTGGCGCA ACTOCGTGCG CCACAACCTT TCGCTCAACG ACTGCTTCGT CAAGGTGCTG 540 
CGCGACCCCT CGCGGCCCTG GGGCAAGGAC AACTACTGGA TGCTCAACCC CAACAGCGAG 600 
TACACCTTCG CCGACGGGGT CTTCCGCCGC CGCCGCAAGC GCCTCAGCCA CCGCGCGCCG 660 
GTCCCCGCGC CCGGGCTGCG GCCCGAGGAG GCCCCGGGCC TCCCCGCCGC CCCGCCGCCC 720 
GCGCCCGCCG CCCCGGCCTC GCCCCGCATG CGCTCGCCCG CCCGCCAGGA GGAGCGCGCC 780 
AGCCCCGCGG GCAAGTTCTC CAGCTCCTTC GCCATCGACA GCATCCTGCG CAAGCCCTTC 840 
CGCAGCCGTC GCCTCAGGGA CACGGCCCCC GGGACGACGC TTCAGTGGGG CGCCGCGCCC 900 
TGCCCGCCGC TGCCCGCOTT CCCCGCGCTC CTCCCCGCGG CGCCCTGCAG GGCOCTGCTG 960 
CCGCTCTGCG CGTACGGCGC GGGCGAGCCG GCGCGGCTGG GCGCGCGCGA GGCCGAGGTG 1020 
CCACCGACCG CGCCGCCCCT CCTGCTTGCA CCTCTCCCGG CGGCGGCCCC CGCCAAGCCA 1080 
CTCCGAGGCC CGGCGGCCGG CGGCGCGCAC CTGTACTGCC CCCTGCGGCT GCCCGCAGCC 1 140 
CTGCAGGCGG CCTTAGTCCG NCGTCCTGGC CCGCACCTGT CGTACCOGGT GGAGACGCTC 1200 
CTAGCTTGA - 

Seq ID NO: 32 Protein Effluence: 
Protein Accession #: NPJ50285.1 

1 11 21 31 41 51 

I I I I I I 

MKLEVFVPRA AHGDKQGSDL EGAGGSDAPS PLSAAGDDSL GSDGDCAAKP SAGGGARDTQ 60 
GDGEQSAGGG PGAEEAIPAA AAAAWAEGA EAGAAGPGAG GAGSGEGARS KPYTRRPKPP 120 
YSYIALIAMA IRDSAGGRLT LAEINEYLMG KFPFFRGSYT GWRNSVRHNL SLNDCFVKVL 1 80 
RDPSRPWGKD NYWMLNPNSE YTFADGVFRR RRKRLSHRAP VPAPGLRPEE APGLPAAPPP 240 
APAAPASPRM RSPARQEERA SPAGKFSSSF AIDSELRKPF RSRRLRDTAP GTTLQWGAAP 300 
CPPLPAFPAL LPAAPCRALL PLCAYGAGEP ARLG AREAEV PPTAPPLLLA PLPAAAPAKP 360 
LRGPAAGGAH LYCPLRLPAA LQAALVRRPG PHLSYPVETL LA 

Seq ID NO: 33 WteSSOSSSS. 

Nucleic Acid Accession #: NM_012 128.2 

Coding sequence: 43-2796 

1 11 21 31 41 51 

I I I I I 1 

GAACAAACCA ACATTTGAGC CAGGAATAAC TAGAGAGGAA CAATGGGGTT ATTCAGAGGT 60 
TTTGTTTTCC TCTTAGTTCT GTGCCTGCTG CACCAGTCAA ATACTTCCTT CATTAAGCTG 120 
AATAATAATG GCTTTGAAQA TATTGTCATT GTTATAGATC CTAGTGTGCC AGAAGATGAA 180 
AAAATAATTG AACAAATAGA GGATATGGTG ACTACAGCTT CTACGTACCT GTTTG AAGCC 240 
ACAGAAAAAA G A1 1 1 11111 CAAAAATGTA TCTATATTAA TTCCTGAGAA TTGQAAGGAA 300 
AATCCTCAGT ACAAAAGGCC AAAACATGAA AACCATAAAC ATGCTGATGT TATAGTTGCA 360 
CCACCTACAC TCCCAGGTAG AGATGAACCA TACACCAAGC AGTTCACAGA ATGTGGAGAG 420 
AAAGGCGAAT ACATTCACTT CACCCCTGAC CTTCTACTTG GAAAAAAACA AAATGAATAT 480 
GGACCACCAG GCAAACTGTT TGTCCATGAG TGGGCTCACC TCCGGTGGGG AGTGTTTGAT 540 
GAGTACAATG AAGATCAGCC TTTCTACCGT GCTAAGTCAA AAAAAATCGA AGCAACAAGG 600 
TGTTCCGCAG GTATCTCTGG TAGAAATAGA GTTTATAAGT GTCAAGGAGG CAGCTGTCTT 660 
AGTAGAGCAT GCAGAATTGA TTCTACAACA AAACTGTATG GAAAAGATTG TCAATTCTTT 720 
CCTGATAAAG TACAAACAG A AAAAGCATCC ATAATGTTTA TGCAAAGTAT TGATTCTGTT 780 
GTTGAATTTT GTAACGAAAA AACCCATAAT CAAG AAGCTC CAAGCCTACA AAACATAAAG 840 
TGCAATTTTA GAAGTACATG GGAGGTGATT AGCAATTCTG AGGATTTTAA AAACACCATA 900 
CCCATGGTGA CACCACCTCC TCCACCTGTC TTCTCATTGC TGAAGATCCG TCAAAGAATT 960 
GTGTGCTTAG TTCTTGATAA GTCTGGAAGC ATGGGGGGTA AGGACCGCCT AAATCGAATG 1020 
AATCAAGCAG CAAAACATTT CCTGCTGCAG ACTGTTGAAA ATGGATCCTG GGTGGGGATG 1080 
GTTCACTTTG ATAGTACTGC CACTATTGTA AATAAGCTAA TCCAAATAAA AAGCAGTGAT 1 140 
GAAAGAAACA CACTCATGGC AGGATTACCT ACATATCCTC TGGGAGGAAC TTCCATCTGC 1200 
TCTGGAATTA AATATGCATT TCAGGTGATT GGAGAGCTAC ATTCCCAACT CGATGGATCC 1260 
GAAGTACTGC TGCTGACTGA TGGGGAGGAT AACACTGCAA GTTCTTGTAT TGATGAAGTG 1320 
AAACAAAGTG GGGCCATTGT TCATTTTATT GCTTTGGGAA G AGCTGCTGA TGAAGCAGTA 1380 
ATAGAGATGA GCAAGATAAC AGGAGGAAGT CATTTTTATG TTTCAGATGA AGCTCAGAAC 1440 
AATGGCCTCA TTGATGCTTT TGGGGCTCTT ACATCAGGAA ATACTGATCT CTCCCAGAAG 1500 
TCCCTTCAGC TCGAAAGTAA GGGATTAACA CTGAATAGTA ATGCCTGGAT GAACGACACT 1560 
GTCATAATTG ATAGTACAGT GGGAAAGGAC ACGTTCTTTC TCATCACATG GAACAGTCTG 1620 
CCTCCCAGTA TTTCTCTCTG GGATCCCAGT GGAACAATAA TGGAAAATTT CACAGTGGAT 1 680 
GCAACTTCCA AAATGGCCTA TCTCAGTATT CCAGGAACTG CAAAGGTGGG CACTTGGGCA 1740 
TACAATCTTC AAGCCAAAGC GAACCCAGAA ACATTAACTA TTACAGTAAC TTCTCGAGCA 1800 
GCAAATTCTT CTGTGCCTOC AATCACAGTG AATGCTAAAA TGAATAAGGA CGTAAACAGT 1860 
TTCCCCAGCC CAATGATTGT TTACGCAGAA ATTCTACAAG GATAT GTACC TGTTCTTGGA 1920 
GCCAATGTGA CTGCTTTCAT TGAATCACAG AATGGACATA CAGAAGTTTT GGAACTTTTG 1980 
GATAATGGTG CAGGCGCTG A TTCTTTCAAG AATGATGGAG TCTACTCCAG GTATTTTACA 2040 
GCATATACAG AAAATGGCAG ATATAGCTTA AAAGTTCGGG CTCATGGAGG AGCAAACACT 2100 
GCCAGGCTAA AATTACGGCC TCCACTGAAT AGAGCCGCGT ACATACCAGG CTGGGTAGTG 2160 
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AACGGGGAAA TTGAAGCAAA CCCGCCAAGA CCTGAAATTG ATGAGGATAC TCAGACCACC 2220 
TTGGAGGATT TCAGCCG AAC AGCATCCGGA GGTGCATTTG TGGTATCACA AGTCCCAAGC 2280 
CTTCCCTTGC CTGACCAATA CCCACCAAGT CAAATCACAG ACCTTGATGC CACAGTTCAT 2340 
GAGG ATAAGA TTATTCTTAC ATGGACAGCA CCAGGAGATA ATTTTGATGT TGGAAAAGTT 2400 
CAAOGTTATA TCATAAGAAT AAGTGCAAGT ATTCTTGATC TAAGAGACAG TTTTGATGAT 2460 
GCTCTTCAAG TAAATACTAC TGATCTGTCA CCAAAGGAGG CCAACTCCAA GGAAAGCTTT 2520 
GCATTTAAAC CAGAAAATAT CTCAGAAGAA AATGCAACCC ACATATTTAT TGCCATTAAA 2580 
AGTATAGATA AAAGCAATTT GACATCAAAA GTATCCAACA TTGCACAAGT AACTTTGTTT 2640 
ATCCCTCAAG CAAATCCTG A TGACATTGAT CCTACTCCTA CTCCTACTOC TACTCCTGAT 2700 
AAAAGTCATA ATTCTGGAGT TAATATTTCT ACGCTGGTAT TGTCTGTGAT TGGGTCTGTT 2760 
GTAATTGTTA ACTTTATTTT AAGTACCACC ATTTG AACCT TAACG AAGAA AAAAATCTTC 2820 
AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT AAGTAAAGGA TATTTCTGAA 2880 
TCTTAAAATT CATCCCATGT GTGATCATAA ACTCATAAAA ATAATTTTAA GATGTCGGAA 2940 
AAGGATACTT TG ATTAAATA AAAACACTCA TGGATATGTA AAAACTGTCA AGATTAAAAT 3000 
TTAATAGTTT CATTTATTTG TTATTTTATT TGTAAGAAAT AGTGATG AAC AAAGATCCTT 3060 
TTTCATACTG ATACCTGGTT GTATATTATT TGATGCAACA GTTTTCTGAA ATGATATTTC 3120 
AAATTGCATC AAGAAATTAA AATCATCTAT CTGAGTAGTC AAAATACAAG TAAAGGAGAG 3180 
CAAATAAACA ACATTTGGAA AAAAAAAAAA AAAAAAAA. 

Sea ID NO. 34 Protein sequence: 
Protein Accession P. NP_03 6260.1 

1 11 21 31 41 51 

I I I I I I 

MGLFRGFVFL LVLCLLHQSN TSFIKLNNNG FEDIV1VIDP SVPEDEKIIE QIEDMVTTAS 60 
TYLFEATEKR FFFKNVSILI PENWKENPQY KRPKHENHKH ADVIVAPPTL PGRDEPYTKQ 120 
FTECGEKGEY MFTPDLLLG KKQNEYGPPG KLFVHEWAHL RWGVFDEYNE DQPFYRAKSK 1 80 
KIEATRCSAG ISGRNRVYKC QGGSCLSRAC RIDSTTKLYG KDCQFFPDKV QTEKASIMFM 240 
QSIDSWEFC NEKTHNQEAP SLQNBCCNFR STWEVISNSE DFKNTIPMVT PPPPPVFSLL 300 
BORQRIVCLV LDKSGSMGGK DRLNRMNQAA KHFLLQTVEN GSWVGMVHFD STATTVNKLI 360 
QKSSDERNT LMAGLPTYPL GGTSICSGIK YAFQVIGELH SQLDGSEVLL LTDGEDNT AS 420 
SODEVKQSG ATVHFIALGR AADEAVffiMS KITGGSHFYV SDEAQNNGLI DAFGALTSGN 480 
TDLSQKSLQL ESKGLTLNSN AWMNDTVED STVGKDTFFL ITWNSLPPSI SLWDPSGTIM 540 
ENFTVDATSK MA YLSIFGT A KVGTWAYNLQ AKANPETLTI TVTSRAANSS VPPTTVN AKM 600 
NKDVNSFPSP MIVYAEILQG YVPVLGANVT AFIESQNGHT EVLELLDNGA GADSFKNDGV 660 
YSRYFTAYTE NGRYSLKVRA HGGANTARLK LRPPLNRAAY IPGWWNGEI EANPPRPEID 720 
EDTQTTLEDF SRTASGGAFV VSQVPSLPLP DQYPPSQITD LDATVHBDKI ILTWTAPGDN 780 
FDVGKVQRYI IRISASILDL RDSFPDALQV NTTDLSPKEA NSKESFAFKP ENISEENATH 840 
IFIAKSIDK SNLTSKYSNI AQVTLFIPQA NPDDIDPTPT PTPTTDKSHN SGVNISTLVL 900 
SV1GSWIVN FILSTTI 

Seq ID Na 35 fiijIAjsgHSGSS 

Nucleic Add Accession #: NMJD0O901.1 

Coding sequence: 217-3171 

1 II 21 31 41 51 

I I I I I I 

CGCGGGAGCC AACTTCAGGC TGCTCAGAGG AAGCCCGTGC AGTCAGTCAC CTGGGTGCAA 60 
GAGCGTTGCT GCCTCGGGCT CTCCCGCTGC AGGGAGAGCG GCACTCGCTG GCCTGGATGT 120 
GGTTGGATTT AGGGGGGCTC CGCAGCAGGG GTTTCGTGGC GGTGGCAAGC GCTGCAACAG 180 
GTAGACGGCG AGAGACGGAC CCCGGCCGAG GCAGGGATGG AGACCAAAGG CTACCACAGT 240 
CTCCCTGAAG GTCTAGATAT GGAAAGACGG TGGGGTCAAG TTTCTCAGGC TGTGGAGCGT 300 
TCTTCCCTGG GACCTACAGA GAGGACCGAT GAGAATAACT ACATGGAGAT TGTCAACGTA 360 
AGCTGTGTTT CCGGTGCTAT TCCAAACAAC AGTACTCAAG GAAGCAGCAA AGAAAAACAA 420 
GAACTACTCC CTTGCCTTCA GCAAGACAAT AATCGGCCTG GGATTTTAAC ATCTGATATT 480 
AAAACTGAGC TGGAATCTAA GGAACTTTCA GCAACTGTAG CTGAGTCCAT GGGTTTATAT 540 
ATGGATTCTG TAAGAGATGC TGACTATTCC TATGAGCAGC AGAACCAACA AGGAAGCATG 600 
AGTCCAGCTA AGATTTATCA GAATGTTGAA CAGCTGGTGA AATTTTACAA AGGAAATGGC 660 
CATCGTCCTT CCACTCTAAG TTGTGTGAAC ACGCCCTTGA GATCATTTAT GTCTGACTCT 720 
GGGAGCTCCG TGAATGGTGG CGTCATGCGC GCCATTGTTA AAAGCCCTAT CATGTGTCAT 780 
GAG AAAAGCC CGTCTGTTTG CAGCCCTCTG AACATGACAT CTTCGGTTTG CAGCCCTGCT 840 
GGAATCAACT CTGTGTCCTC CACCACAGCC AGCTTTGGCA GTTTTCCAGT GCACAGCCCA 900 
ATCACCCAGG GAACTCCTCT GACATGCTCC CCTAATGCTG AAAATCGAGG CTCCAGGTCG 960 
CACAGCCCTG CACATGCTAG CAATGTGGGC TCTCCTCTCT CAAGTCCGTT AAGTAGCATG 1020 
AAATCCTCAA TTTCCAGCCC TCCAAGTCAC TGCAGTGTAA AATCTCCAGT CTCCAGTCCC 1080 
AATAATGTCA CTCTGAGATC CTCTGTGTCT AGCCCTGCAA ATATTAACAA CTCAAGGTGC 1140 
TCTGTTTCCA GCCCTTCG AA CACTAATAAC AGATCCACGC TTTCCAGTCC GGCAGCCAGT 1200 
ACTGTGGGAT CTATCTGTAG CCCTGTAAAC AATGCCTTCA GCTACACTGC TTCTGGCACC 1260 
TCTGCTGGAT CCAGTACATT GCGGGATGTG GTTCCCAGTC CAOACACGCA GGAGAAAGGT 1320 
GCTCAAGAGG TCCCTTTTCC TAAGACTGAG GAAGTAGAGA GTGCCATCTC AAAT GGTGTG 1380 
ACTGGCCAGC TTAATATTGT CCAGTACATA AAACCAGAAC CAGATGGAGC TTTTAGCAGC 1440 
TCATGTCTAG G AGGAAATAG C AAAATAAAT TCGG ATTCTT CATTCTCAGT ACC AATAAAG 1500 
CAAGAATCAA CCAAGCATTC ATGTTCAGGC ACCTCTTTTA AAGGGAATCC AACAGTAAAC 1560 
CCGTTTCCAT TTATGGATGG CTCGTATTTT TCCTTTATGG ATGATAAAGA CTATTATTCC 1620 
CTATCAGGAA TTTTAGGACC ACCTGTGCCC GGCTTTGATG GTAACTGTGA AGGCAGCGG A 1 680 
TTCCCAGTGG GTATTAAACA AGAACCAGAT GACGGGAGCT ATTACCCAGA GGCCAGCATC 1740 
CCTTCCTCTG CTATTGTTGG GGTGAATTCA GGTGGACAGT CCTTCCACTA CAGGATTGGT 1800 
GCTCAAGGTA CAATATCTTT ATCACGATCG GCTAGAGACC AATCTTTCCA ACACCTGAGT 1860 
TCCTTTCCTC CTGTCAATAC TTTAGTGGAG TCATGGAAAT CACACGGCGA CCTGTCGTCT 1920 
AGAAGAAGTG ATGGGTATCC GGTCTTAGAA TACATTCCAG AAAATGTATC AAGCTCTACT 1980 
TTACGAAGTG TTTCTACTGG ATCTTCAAGA CCTTCAAAAA TATGTTTGGT GTGTGGGGAT 2040 
GAGGCTTCAG GATGCCATTA TGGGGTAGTC ACCTGTGGCA gctgcaaagt TTTCTTCAAA 2100 
AGAGCAGTGG AAGGGCAACA CAACTATTTA TGTGCTGGAA GAAATGATTG CATCATTGAT 2160 
aagattcgac g aaagaattg tcctgcttgc agacttcaga AATGTCTTCA AGCTGGAATG 2220 

AATTTAGGAG CACGAAAGTC AAAGAAGTTG GGAAAGTTAA AAGGGATTCA CGAGGAGCAG 2280 
CCACAGCAGC AGCAGCCCCC ACCCCCACCC CCACCCCCGC AAAGCCCAGA GGAAGGGACA 2340 



253 



WO 02/068677 



PCT/US02/06001 



ACOTACATCG CTOCTGCAAA AGAACCCTCG GTCAACACAO CACTGGTTCC TCAGCTCTCC 2400 
ACAATCTCAC GAGCGCTCAC ACCTTCCCCC GTTATGGTCC TTGAAAACAT TGAACCTGAA 2460 
ATTGTATATG CAGGCTATGA CAGCTCAAAA CCAGATACAG CCGAAAATCT GCTCTCCACG 2520 
CTCAACCGCT TAGCAGGCAA ACAGATGATC CAAGTCGTGA AGTGGGCAAA GGTACTTCCA 2580 
GGATTTAAAA ACTTGCCTCT TGAGGACCAA ATTACCCTAA TCCAGTATTC TTGGATGTGT 2640 
CTATCATCAT TTGOCTTGAG CTGGAGATCG TACAAACATA CGAACAGCCA ATTTCTCTAT 2700 
TTTGCACCAG ACCTAGTCIT TAATGAAGAG AAGATGCATC AGTCTGCCAT GTATGAACTA 2760 
TGCCAGGGGA TGCACCAAAT CAGCCITCAG TTCGTTCGAC TGCAGCTCAC CTTTGAAGAA 2820 
TACACCATCA TGAAAGTTTT GCTGCTACTA AGCACAATTC CAAAGGATGG CCTCAAAAGC 2880 
CAGGCTGCAT TTGAAGAAAT GAGGACAAAT TACATCAAAG AACTGAGGAA GATGGTAACT 2940 
AAGTGTCCCA ACAATTCTGG GCAG AGCTGG CAGAGGTTCT ACCAACTGAC CAAGCTGCTG 3000 
GACTCCATGC ATGACCTGGT GAGCGACCTG CTGGAATTCT GCTTCTACAC CTTCCGAGAG 3060 
TCCCATGCGC TGAAGGTAGA GTTCCCCGCA ATGCTGGTGG AGATCATCAG CGACCAGCTG 3 120 
CCCAAGOTGG AGTCGGGGAA CGCCAAGCCG CTCTACTTCC ACCGGAAGTG ACTGOCCGCT 3180 
GCCCAGAAGA ACTTTGCCTT AAGTTTCCCT GTGTTGTTCC ACACCCAGAA GGACCCAAGA 3240 
AAACCTGTTT TTAACATGTG ATGGTTGATT CACACTTGTT CAACAGTTTC TCAAGTTTAA 3300 
AGTCATGTCA GAGGTTTGGA GCCGGGAAAG CT G'UIU CC GTGGATTTGG CGAGACCAGA 3360 
GCAGTCTGAA GGATTCCCCA CCTCCAATCC CCCAGCGCTT AGAAACATGT TCCTGTTCCT 3420 
CGGGATGAAA AGCCATATCT AGTCAATAAC TCTGATTTTG ATATTTTCAC AGATGGAAGA 3480 
AGTTTTAACT ATGCCGTGTA GTTTCTGGTA TCGTTCGCTT GTTTTAAAAG GGTTCAAGGA 3540 
CTAACGAACG TTTTAAAGCT T ACCCTTGGT TTGCACATAA AACGTATAGT CAATATGGGG 3600 
CATTAATATT CTTTTGTTAT TAAAAAAACA CAAAAAAATA ATAAAAAAAT ATATACAGAT 3660 
TCCTGTTGTG TAATAACAGA ACTCGTGGCG TGGGGCAGCA GCTGCCTCTG AGOCCTCGCT 3720 
CGTCCAOGGT CTTCTGCATC ACTGGTATAC ACACTCGTTA GCGTCCATTT CTTATTTAAT 3780 
TAGAATGGAT AAGATGATGT TAAATGCCTT GGTTTGATTT CTAGTATCTA TTGTGTTGGC 3840 
TTTACAAATA ATTTTTTGCA GTCTTTTGCT GTGCTGTACA TTACTGTATG TATAAATTAT 3900 
GAAGGACCTG AAATAAGGTA TAAGG ATCTT TTGTAAATGA GACACATACA AAAAAAATCT 3960 
TTAATGGTTA ATAGGATGAA TGGGAAAGTA TTTTTGAAAG AATTCTATTT TGCTGGAGAC 4020 
TATTTAAGTA CTATCTTTGT CTAAACAAGG TAATTTTTTT TTGTAAAGTG CAATGTCCTG 4080 
CATGCATAAT GAACCGTTTA CAGTGTATTT AAGAAAGGGA AAGCTGTGCC TTTTTTAGCT 4140 
TCATATCTAA TTTACCATTA TTTTACAGTC TCTG TTGTAA ATAA CCACAC TGAAACCTCT 4200 
TCGGTTGTCT TGAAAOCTTT CTACTTT1 1C TGTACTTTTT GTTTTGTTCT TGGTCTCCXX3 4260 
CTTGGGGCAT TTGTGGGACT CCAGCACGTT TTCTGGCTTC TGCTTCATCC TGCTCCATCG 4320 
GGGAATGACA CACTGCGGTG TCTGCAGCTC CTGGAAGGTG TCATTTGACA ACACATGTGG 4380 
GAGAGGAGGT CCTTGGAGTG CTGCAGCTTT GGGAAAGOCT GOCTOGTTTC CCTTTTOCTC 4440 
TAGAAGCAGA AOCAGCTCTA CGAGAGTGAG ACTGGGAACT TGATGGCTCA GAGAGCATCT 4500 
TTTCCTCCCA TTTTAG AAAA TCAGATTTTC TCCTGTGGGA AAAAAAAATT CCATGCACTC 4560 
TCTCTCTGTT AAAGATCAGC TATTCCCTTC TGATCTTGGA AAGAGGTTCT GCACTCCTGG 4620 
AACCGGTCAC AGGAACGCAC AG ATCATGGC AGGATGCGCT GGGACGGCCC ATCTTGGCAA 4680 
GGTTCAGTCT GAATGGCATG GAGACCGGGA GATAGAGGGG TTTTAGATTT TTAAAAGGTA 4740 
GGTTTTAAAA ATAAGTTTTA TACATAAACA GTTTTGGAGA AAAATTACAG ATCATATAAG 4800 
CAAGACAGTG GCACTAAAAT GTTTAATTCA TTAATCTOTT TGTTTGGCAC TGATGCAATG 4860 
TATGGCrnT CTCTTGCCCC AAATCACAAA CATATGTATC TTTGGGGAAA CTAACAATAT 4920 
GATTGCACTA AATAAACTAC TTTGAATAGA GGCCAAATTA ATCTTTTAAA AATGATGATA 4980 
ATCATCAGGT TTACTCAGTG AAATCATATT AATTATTTTC CAAAATCTAA AAGCTGTAGC 5040 
TGGAGAAGCC CATGGCCACG AGGAAGGAGC AATTAATTAG ATCAACACTT TTCTOCAGGG 5100 
TTCACCATGC AGGCAACATT ACCTTGTCTT TCAAAAGACA CCTGCCTTAG TGCAAGGGGA 5160 
AACCTGTGAA AGCTGCACTC AGAGGGAGGA GTCTTTCTTA CATAATTTGC AATTTCAGGA 5220 
ATTTAATTTA TAGGCAGATC TTTAAATACA GTCAACTTAC GGTGCACAGT AATATGAAAG 5280 
CCACACTTTG AAGGTAATAA ATACACAGCA TGCAGACTGG GAGTTGCTAG CAAACAAATG 5340 
GCTTACTTAC AAAAGCAGCT TTTAGTTCAG ACTTAGTTTT TATAAAATGA GAATTCTGAC 5400 
TTACTTAACC AGGTTTGGGA TGGAGATGGT CTGCATCAGC TTTTTGTATT AACAAAGTTA 5460 
CTGGCTCTTT GTGTGTCTCC AGGTAACTTT GCTTGATTAA ACAGCAAAGC CATATTCTAA 5520 
ATTCACTGTT GAATGCCTGT CCCAGTCCAA ATTGTCTGTC TGCTCTTATT TTTGTACCAT 5580 
ATTGCTCTTA AAAATCTTGG TTTGGTACAG TTCATAATTC ACCAAAAAGT TCATATAATT 5640 
TAAAGAAAGA CTAAATTAGT TTAAAATGAA GCAATTTATA TCTTTATGCA AAAACATATG 5700 
TCTGTCTTTG CAAAGG ACTG TAAGCAGATT ACAATAAATC CTTTACTTT 



Seq ID NO: 36 Protein sequence: 
Protein Accession #: NP 000892.1 

1 11 21 31 41 51 

I I I I I I 

METKGYHSLP EGLDMERRWG QVSQA VERSS LGPTERTDEN NYMETVNVSC VSGAIPNNST 60 
QGSSKEKQBL LPCLQQDNNR PGILTSDIKT ELESKELSAT VAESMGLYMD SVRDADYSYE 120 
QQNQQGSMSP AKIYQNVEQL VKFYKGNGHR PSTLSCVNTP LRSFMSDSGS SVNGGVMRAI 1 80 
VKSPIMCHEK SPSVCSPLNM TSSVCSPAGI NSVSSTTASF GSFPVHSPH QGTPLTCSPN 240 
AENRGSRSHS PAHASNVGSP LSSPLSSMKS SISSPPSHCS VKSPVSSPNN VTLRSSVSSP 300 
ANINNSRCSV SSPSNTNNRS TLSSPAASTV GSICSPVNNA FSYTASGTSA GSSTLRDWP 360 
SPDTQEKGAQ EVPFPKTEEV ESAISNGVTG QLNTVQY1KP EPDGAFSSSC LGGNSKINSD 420 
SSFSVPKQE STKHSCSGTS FKGNPTVNPF PFMDGSYFSF MDDKDYYSLS GILGPPVPGF 480 
DGNCEGSGFP VGKQEPDDG SYYPEASIPS SATVGVNSGG QSFHYRIGAQ GTTSLSRSAR 540 
DQSFQHLSSF PPVNTLVESW KSHGDLSSRR SDGYPVLEYI PENVSSSTLR SVSTGSSRPS 600 
KICLVCGDBA SGCHYGWTC GSCKVFFKRA VEGQHNYLCA GRNDCDDK1 RRKNCPACRL 660 
QKCLQAGMNL GARKSKKLGK LKGIHEEQPQ QQQPPPPPPP PQSPEEGTTY IAPAKEPSVN 720 
TALVPQLSTI SRALTPSPVM VLENIEPEIV YAGYDSSKPD TAENLLSTLN RLAGKQMIQV 780 
VKWAKVLPGF KNLPLEDQIT UQYSWMCLS SFALSWRSYK HTNSQFLYFA PDLVFNEEKM 840 
HQSAMYELCQ GMHQISLQFV RLQLTFEEYT IMKVLLLLST IPKDGLKSQA AFEEMRTNYI 900 
KELRKMVTKC PNNSGQSWQR FYQLTKLLDS MHDLVSDLLE FCFYTFRESH ALKVEFPAML 960 
VEHSDQLPK VESGNAKPLY FHRK 

ID NO: 37 DNA segn^ c*. 
Nucleic Acid Accession #: see Table 25 & 25 A for complete list 
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TC^^apI J^^^^ C ^ TC AGCTCCCAAG TAGCTGGGAT TACAGGTGCA 60 
rr^^S^ TGTATTnTA GTAOAGATGG TTTTCACTAT GTTGGCCAGG 120 

CTGATCTTGA ATTCCTGGCC TGAAGTAATC TGCCTGCCTC AGCCTCCCAA AGTOCTGGGA 1RD 
V^^^^^P SOCATAACTG GTATTTTTT A TaSSota 2« 
i^™ AT TACTCTGTTG TTTCTTCCTT TTTTTTTTTT TnTGGCTTT GACCAATTTG 300 



ivr-A i^i^ivji ju iiih ilc 1 1 rriTTTTTTT TTTTGGCTTT GACC 

rSvS^?^ GTATCTCC TA CCTAGAAAAA AAACACACTA AACAGTAAAT GATTACCAAC 360 

^^^^^C^jScTG^Itmn^^^^^ ^^^^^ CACAAGTGT A. TCAAATT AOT ~ 



SailDNO.38 DNAscauencc 

Nucleic Acid Accession ft NM 001192.1 

Coding sequence: 219-773 

' 11 21 31 41 51 
I I 



^^I?^ EH AGAAACT TQ AATTAGAT GTGGTATTCA AATCCTTACG TGCCQCGAAG 60 
a^H^ 0 CCCCCGTAAG AACCCACGAA GCAGGCGAAG TTCATTGTO T^CA^S 120 
^2£^^°CTG^TTTG CTCTGGAATT CTTGTAGAGA TATTACTTGT CCTTCCAGgT.80 

m^^^J^^J^^"^ tocatgcttg CATACCTTOT CAACTTCGAT 300 
^^^J^?^ 0 " CTAACATGTC AGCGTTATTG TAATGCAAGT GTGACCAATT 350 

S™^^T^° ATrCTCIQQA ocrcrrrGGG actgagctta ataatttctt So 

T°°$^?7TTT C ° T ° < ^ AAT0 TmTGCTAA GGAAGATAAG CTCTGAACCA TTAAAGGACG 480 
of IS™ ™ C ^I% GGTCTCCTGG GCATGGCTAA CATTGACCTG G AAAAGAGGA 540 
^1°^ IQAAATTATTCTTCCGAGAG GCCTCGAGTA CACGGTGGAA GAATGCACCT 600 
GTGAAGACTG CATCAAGAGC AAACCGAAGG TOGACTCTGA CCATTGCTTT CCACTCCCAG 660 
S A J°S^ A AQGC OCAACC ATTCTTGTCA CCACGAAAAC 0 AATGACTAT TGCAaSaGCC 720 
Z25^ G £F^^^^AGTGCTACGGAGATAG AGAAATCAATTTCTXjCTAGGTAAPTAACCA 780^ 
mCGACTCG AGCAGTGCCA CTTTAAAAAT CTnTGTCAG AATAGATGAT GTGTCAGATC 840 
S^fJIPZ^IH IZCAGTTCCC GATACAGCTT TTTCTCCTCT AACTGTGGAA M T 



ScqIDNO:39£ _ 

Protein Accession ft NP001 183.1 

1 » 21 31 41 51 
1 I I I I I 

^^S^*^ 81 -" 1 * ^CQLRCSS NTPPLTCQRY CNASVTOSVK GtNAlLWTCL 60 

VTVEECTCED CDCSKPKVBS DHCFPLPAME EGATILVTTK TNDYCKSLPA ALSA^S 180 
ISAR 



ScqJDmiAOmA&SSSm 

Nuclac Add Accession ft NM 025087.1 

Coding sequence: 1 83-2282 

1 11 21 31 41 51 

££23^ CCGATGATCT ATTACTTTCC TTTGCAAACACTAGAACTTCA 3TO 

CTGGGCTTGA AGOTTTTAGT ATAGCATTTC TTTCTCCAAT ATTCCTAACA ATCACTCCTT 360 
TCTGGAAATT GGTTAACAAG AAGTGGATGC TAACCCTGCT GAGGA^TC ACTA^CA 470 
G ^I A S A < ^ CAaG CT CCAAATGCCA AACTTCGACTGATGGTT^T^C^nTGG^G 480 
IS^^f A J? GATA0T0 CAAGCTGTGA CTTGGTGGTC AGGAAGTCAT TOCAAAGGT 540 
ACCTCAGAAT TTGGGGATTC ATTTTAGGAC AGATTGTTCT TGTTGTTCTA CGCATATGGT fitm 
ATACrrCACT AAACCCAATC TGGAGTTATC AGATGTCCAA CA^GTOATACTGACArTAA ^60 

^^S^J^^S'^ ACGOOaATGG CCTCTAGACC CAACTGGCTC CTOGCAGGGG 780 
CTGCTTTTGG TAGCCTTGTG TTCCTCACCC AC7XJGGTTTT TGGAGAAGTC 1CTCTTOTTT mo 
^ A 5 A ^ C A ° T OiGTGGG CATCCACATC CAGGGCC^A^^S^A^rsoo 
CAGTACTGCT GTGCTTGGCA AGTGGATTGA TGCTTCCATC TTGTrTGTGGTn"C<3TGGTA 960 

SSSE^SSSS? 0 " acaggaacag cttcagct^ <^cS?SSc\ irao 

^^rn ESSSIH GGCTGTGTCT TCGCCATCTT TACTGCATO ATGTOGCCcF X 
AAACACTTGG ACACCTTATT AACTCAGGGA CAAACCCTGG GAAAACCATG ACTj^rr* li Jn 
TGATATTTTA TCTTCTAGAA ATATmTCT GTGCCTGGTG CACAGO^ A^GmOTCC , ^ 
™^? G J^ A ^ CTAGA OAAAGATCAG ATGTGCm^GO^TC S^A 
TS??^^ J^S^EJ GCTCCTAAGA AAAACCTTCA TTTGCTTCrr oC^CAAAAA 1 320 
^CTnTC AGAAAGAGTG AAAAATACAT aAAACmTTCTOTGCCmC 380 
TTGTTGGTGT GGOATTGTTG GGATTAGGAC TACGGCATAA AGCCTATGAG AOAAAjSmo 1440 
?r*ATiri inn rSnmS!?*^ GAGGTCTCTG CTOCCATCTO wlciTT^GG TTTGGAXM^ ISM 
A ™t^°S G S CTAGT CTAGAAAGAT CACKTCACCTOCTC^TOAAACAGG^A^ 1560 
ATTTCATAAC AATTTTGGAG AGTGATCCTTCTAAGCCCTATATGGTO/^CAAT^CTT^ 1620 
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CCATGTGGCT AGGGGAAAAG TTGGGTTTCT ATACAQACTT TQGTCCAAGC ACAAGGTATC 1680 
ACACTTGGGG GATTATGGCT TTGTCAAGAT ACCCAATTGT GAAATCTGAG CATCACCTTC 1740 
TTCCGTCACC AGAGGGCGAG ATCGCACCAG CCATCACATT GACCGTTAAC ATTTCGGGCA 1800 
AGCTGGTGGA TTTTGTCGTG ACACACTTTG GGAACCACGA AGATGACCTC GACAGGAAAC 1860 
TGCAGGCTAT TGCTGTTTCA AAACTACTGA AAAGTAGCTC TAATCAAGTG ATATTTCTGG 1920 
GATATATCAC TTCAGCACCT GCCTCCAGAG ATTATCTACA GCTCACTGAA CATGGCAATG 1980 
TGAAGGATAT CGACAGCACT GATCATGACA GATGGTGTGA ATACATTATG TATCGAGGGC 2040 
TGATCAGGTT GGGTTATGCA AGAATCTCCC ATGCTGAACT GAGTGATTCA GAAATTCAGA 2100 
TGGCAAAATT TAGGATCCCT GATGACCCCA CTAATTATAG AGACAACCAG AAAGTGGTCA 2160 
TAGACCACAG AGAAGTTTCT GAG AAAATTC ATTTTAATCC CAGATTTGGA TCCTACAAAG 2220 
AAGGACACAA TTATGAAAAC AACCATAATT TTCATATGAA TACTCCCAAA TACTTTTTAT 2280 
GAAACATTTA AAACAAGAAG TTATTGGCTG GGAAAATCTA AGAAAAAAAG TATGTAAGAT 2340 
AAAAAGAAGA GATTAATGAA AGTGGG AAAA TACACATGAA GAACCTCAAC TTAAAAAACA 2400 
CATGGTATCT ATGCAGTGGG AAATTACCTC CATTTGTAAA CTATGTTGCT TAATAAAAAC 2460 
ATTTCTCTAA AAAAAAAAAA AAAAAA 

Scq ID NO: 41 Protein sequence: 
Protein Accession #: NPJD79363.1 



1 11 21 31 41 51 
I I I I I I 

MTSLWREILL ESLLGCVSWS LYHDLGPMIY YFPLQTLELT GLEGFSIAFL SPIFLTITPF 60 
WKLVNKKWML TLLRHTIGS IASFQAPNAK LRLMVLALGV SSSUVQAVT WWSGSHLQRY 120 
LRIWGF1LGQ IVLWLRIWY TSLNPIWSYQ MSNKVILTLS AIATLDRIGT DGDCSKPEEK. 1 80 
KTGEVATGMA SRPNWLLAGA AFGSLVFLTH WVFGEVSLVS RWAVSGHPHP GPDPNPFGGA 240 
VLLCLASGLM LPSCLWFRGT GLIWWVTGTA SAAGLLYLHT WAAA VSGCVF AIFTASMWPQ 300 
TLGHLINSGT NPGKTMTIAM DFYLLEEFFC AWCTAFKFVP GGVYARBRSD VLLGTMMLI1 360 
GLNMLFGPKK NLDLLLQTKN SSKVLFRKSE KYMKLFLWLL VGVGLLGLGL RHKAYERKLG 420 
KVAPTKEVS A AIWPFRFGYD NEGWSSLERS AHLLNETGAD FTHLESDAS KPYMGNNDLT 480 
MWLGEKLGFY TDFGPSTRYH TWGIMALSRY PIVKSEHHLL PSPEGBIAPA ITLTVNISGK 540 
LVDFWTHFG NHEDDLDRKL Q AIAVSKLLK SSSNQVIFLG YITSAPGSRD YLQLTEHGNV 600 
KDIDSTDHDR WCEYIMYRGL IRLGYARISH AELSDSEIQM AKFRIPDDFT NYRDNQKWI 660 
DHREVSEJQH FNPRFGSYKE GHNYENNHNF HMNTPKYFL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting a metastatic colorectal cancer-associated 

2 transcript in a cell from a patient, the method comprising contacting a biological sample from 

3 the patient with a polynucleotide that selectively hybridizes to a sequence at least 80% 

4 identical to a sequence as shown in Tables 1-26. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 1, wherein the polynucleotide is labeled. 

1 4. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 5. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -26. 

1 6. An expression vector comprising the nucleic acid of claim 5. 

1 7. A host cell comprising the expression vector of claim 6. 

1 8. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-26. 

1 9. An antibody that specifically binds a polypeptide of claim 8. 

1 10. The antibody of claim 10, which is an antibody fragment. 

1 11. The antibody of claim 10, which is a humanized antibody 

1 12. A method of detecting a metastatic colorectal cancer cell in a 

2 biological sample from a patient, the method comprising contacting the biological sample 

3 with an antibody of claim 9. 

1 13. The method of claim 12, wherein the antibody is labeled. 

1 14. A method of detecting antibodies specific to metastatic colorectal 

2 cancer in a patient, the method comprising contacting a biological sample from the patient 

3 with a polypeptide encoded by a nucleic acid comprises a sequence from Tables 1-26. 
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15. A method for identifying a compound that modulates a metastatic 
colorectal cancer-associated polypeptide, the method comprising the steps of: 

(i) contacting the compound with a metastatic colorectal cancer-associated 
polypeptide, the polypeptide encoded by a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-26.; and 

(ii) determining the functional effect of the compound upon the polypeptide. 

' 16. The method of claim 15, wherein the functional effect is determined by 
measuring ligand binding to the polypeptide. 

1 7. A method of inhibiting proliferation of a metastatic colorectal cancer- 
associated cell to treat colorectal cancer in a patient, the method comprising the step of 
administering to the subject a therapeutically effective amount of a compound that modulates 
a polypeptide encoded by a sequence as shown in Tables 1-26. 

18. A drug screening assay comprising the steps of 

(i) administering a test compound to a mammal having colorectal cancer or a 
cell isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26. in a 
treated cell or mammal with the level of gene expression of the polynucleotide in a control 
cell or TTifmrnipl, wherein a test compound that modulates the level of expression of the 
polynucleotide is a candidate for the treatment of colorectal cancer. 

19 . A pharmaceutical composition for treating a mammal having colorectal 
cancer, the composition comprising a compound identified by the assay of claim 18 and a 
physiologically acceptable excipient. 

20. A method of detecting a metastatic colorectal cancer-associated 
polypeptide in a cell from a patient, the method comprising contacting a biological sample 
from the patient with a antibody that that specifically binds a polypeptide encoded by a 
nucleic acid molecule having polynucleotide sequence as shown in Tables 1-26. 

21. The method of claim 21, wherein the antibody is labeled. 
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